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Description 

CHROMOSOME I GENE AND GONE PRODUCTS 
RELATED TO ALZHEIMER^S DISEASE 

5 . 

Technical Field 

The present invention relates generally to Alzheimer's disease, and more 
specifically to methods and compositions for use in diagnosis and treatment of 
Alzlieinier's Disease. 

10 

Background of the Invention 

Alzheimer's disease (AD) is a devastating, neurodegenerative 
progressive disorder first recognized in 1907 (Alzheimer. Algemeinc Zeisschrift fur 
Psychiatric ^^:146-148, 1907). AD is a common disease in the elderly and is the 
1 5 predominant cause of dementia in people over 65 years of age. The prevalence of AD 
is estimated to be as high as 18.7% and 47.2% for the 75-84 year and > 85 year age 
groups respectively. Thus, there is a large affected population in most countries of the 
world. 

Clinical symptoms of the disease typically begin w^iih subtle short term 

20 memory problems. As the disease progresses, difficulty wath memor>', language, and 
orientation worsen to the point of interfering with the ability of the person to function 
independently. Other symptoms, which are variable, include myoclonus and seizures. 
Duration of AD from the first symptoms of memory loss until death is 10 years on 
average, but may range from 6-8 years to more than 20 years. AD always results in 

25 death, often from respiratory -related illness. 

The pathology in AD is confined exclusively to the central nervous 
system (CNS). The predominant features arc amyloid deposits (plaques) and 
neurofibrillary tangles (NFT). In AD, amyloid is found associated with the vascular 
system of the CNS and as focal deposits in the parenchyma. The major molecular 

30 component of amyloid is a highly hydrophobic peptide called the AP peptide. This 
peptide aggregates into filaments in an anti-fj-plcated sheet structure resulting in the 
birefringent nature of the AD amyloid. While AP is the major component of AD 
amyloid, a partial list of other proteins associated with the amyloid includes a-l-anti- 
chymoirypsin (Abraham, et al.. Cell 52:487-501, 1988), cathepsin D (Cataldo, et al., 

35 Brain Res. 573:181-192, 1990), non-amyloid component protein (Ueda. et al., Proc. 
Natl Acad Sci. USA 90:\ 1282-1 1286, 1993), apolipoprotein E (apoE) (Namba, et al.. 
Brain Res. 54 1:\63-\66, 1991: Wisniewski and Frangione, Neurosci, Lett. 
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/iJ:235-238, 1992; Strittmatter, et al.. Proc, NqL Acad Set. USA 1 977- 1 98 1, 1993), 
apolioprotein J (Choi-Mura, et al., Ac(a Neuropathoi 5J:260-264, 1992; McGeer» et a!.. 
Brain Res, J7P:337-341. 1992), heal shock • protein 70 (Hamos, cl al.. Neurology 
^7:345-350, 1991), complement components (McGeer and Rogers, Neurology^ 
5 ^i:44 7-449, 1992), a^.macroglobin (Strauss, cl al.. Lab. Invest. 66:223-230, 1992), 
intcrIeukin-6 (Strauss, el al.. Lab, Invest 66:223-230, 1992), proteoglycans (Snow, et 
al„ Lab. Invest. 5<^:454-458, 1987), and scrum amyloid P (Coria, cl aL, Lab. Invest. 
5^^:454-458, 1988). Surrounding many plaques are dystrophic ncurites, which are ner\'e 
endings containing abnormal filamentous structures. Plaques are often surrounded by 

10 astrocytes and activated microglial cells expressing immune-related proteins, such as 
the MHC class 11 glycoproteins HLA-DR, EiLA-DP, and HLA-DQ as well as MHC 
class 1 glycoproteins, inicrleukin-2 (IL-2) receptors, and IL-1. The other dominant 
lealure of AD neuropathology is the presence of NFTs. These consist oi' abnomial 
filaments bundled together in neuronal cell bodies, "Ghost" NFTs are also observed in 

15 AD brains, which presumably mark the location of dead neurons. Other 
neuropalhological features include granulovacuolar changes, neuronal loss, gliosis and 
the variable presence of Lewy bodies. 

In the AD brain, the destructive process of the disease is evident on a 
gross level. In the late-stage of AD, venU-icular enlargement and shrinkage of the brain 

20 can be observed by magnetic resonance imaging. On autopsy, extensive gliosis and 
neuronal loss are observed. Thus, (he amyloid plaque structures and NFTs observed at 
autopsy are most likely the end-points of a lengthy disease process, far removed from 
the initiating events of AD. Also, the cells remaining at autopsy are grossly different 
from tiiose of a normal brain. Neurons, which were possibly involved in initiating 

25 events, arc absent and other cell types, such as the activated microglial cells and 
astroctyes, have gene expression patterns not observed in the normal brain. Thus, 
atlempts using biochemical methods to identify key proteins and genes in the initiating 
steps of the disease are hampered by the fact that it is not possible to actually observe 
these critical initiating events. Rather, biochemical dissection of the AD brain at 

30 autopsy is akin to molecular archeology, attempting to reconsuuct the pathogenic 
pathway by comparing the normal brain to the end-stage disease brain. 

Substantial evidence has suggested that inherited genetic defects are 
involved in AD. Numerous early-onset kindreds have been described (Bird, et al., Ann. 
Neurol, 2J:25-31, 1988; Bird, ei al., Ann, NeuroL 25:12-25, 1989; Cook, et al., 

35 Neurology 2P:1402-I412, 1979; Feldman, el al., Neurology^ /i:81 1-824, I960; 
Goudsmit, J. Neurol ScL 49:19-, 1981; Heston and White, Behavior Genet. ^^:3 1 5-331, 
1978: Martin, et al.. Neurology^ ^7:62-68, 1991; Nee, et al„ Arch. Neurol. ^0:203-208, 
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1983; van Bogaeerl, et al., Mschr. Psychait. Neurol. /6?2;249-30K 1940; Wheelan, Ann. 
Hum. Genet. 25:300-309, 1959). (Early-onset is defined herein as onset prior to 65 
years.) Families with multiple late-onset AD cases have also been described (Bird, et 
al., Ann. Neurol. 25:12-25, 1989; Heston and White, Behavior Genet. 5:315-331, 1978; 
Pericak- Vance, el aL, Exp. Neurol. J 02:27 \ -279, 1988). In addition, twin studies have 
documented that monozygotic twins arc more concordant in their AD phenoiype than 
dizygotic twins (Nee, et al.. Neurology 37:359'363^ 1987; [133] ). Also, the families of 
concordant twins have more secondary cases of AD than families of discordant twins 
(Rapoport, el al.. Neurology -//;1549-1553, 1991). 

Genelic dissection of AD has been complicated by the complexity of the 
disease and the limited accuracy of its diagnosis. Because AD is common in the 
elderly, clustering of cases in a family may occur by chance, representing possible 
confounding non-allelic genetic heterogeneity, or etioiogic heterogeneity with genetic 
and non-genetic cases co-existing in the same kindred. In addition, the clinical 
diagnosis of AD is confounded with other dementing diseases common in the elderly. 

Despite the problems associated with resolving complex genetic 
diseases, 2 causative AD loci and 1 risk-modifying gene have been identified. 
Mutations in ihe amyloid precursor protein (APP) gene on chromosome 21 cause early- 
onset (< 65 years) autosomal dominant AD (Goale, et al.. Nature 349:704, 1991). 
Mutations in a recently identified gene (AD3) on chromosome 14 also result in early- 
onset autosomal dominant AD (Schellenberg, et al.. Science 258:66S, 1992; 
Sherrington, et aL, Nature J 75:754-760, 1995). For late-onset AD, the APOE gene has 
been identified as a genetic modifying factor (Strittmatter, et al., Proc. Natl. Acad Sci. 
USA 9():\971, 1993; Corder, et al.. Science 267:921, 1993; Cordcr, et aL, Nat. Genet. . 
7:180-184, 1994; Benjamin, et aL, Lancet 344:472, 1994; Smith, ct al.. Lancet 
344:473-474. 1994), 

The known genetic loci for AD do not account for all cases of AD. For 
example, in late-onset AD approximately half of AD cases do not have the APOE e4 
allele (Brousseau, el a!.. Neurology 342, 1994; Kuusisto, et al.. Brit. Med. J. 309:636. 
1994; Tsai, et al.. Am. J. Hum. Genet. 54: 643, 1994; Liddel, et al,. J. Med. Genet. 
J/; 197, 1994). Also, in the Volga German (VG) kindreds (Cook, et al.. Neurology^ 
29.1402, 1979; Bird, et al., Ann. Neurol 23:25. 1988; Bird, el al., Ann. Neurol. 25:12, 
1989; Bird, Am. Hist. Soc. Germ. Russia J. ^9:1991; Bird, el al., in Heterogeneity 0/ 
Alzheimer's Disease, F. Boiler, el al., Eds. (Spring- Verlag, Heidelberg, 1992) pp. 
118-129), as in several other families with high incidence of AD, the known AD loci 
have been excluded as possible causes (Schellenberg, et al.. Science 258:662, 1992; 
Lannfelt, et al., Nat. Genet. ^:2 18-2 1 9, 1993; van Duijn, et al.. Am. J. Hum. Genet. 
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Ji:7I4-727, 1994; Schellenberg, et al., Science 2^/;1507, 1988; Scheilenberg, et aL, 
Am. J. Hum, Genet 48:563, 1991 ; Scheilenberg, et al., ^m. J. Hum. Genet. ^9:511-517. 
1991; Kamino, et al.. Am. 1 Hum. Genet. 57:998, 1992; Schellenberg, el al.. Am J, 
Hum. Genet. 5J:6I9, 1993; [13]; Schellenberg, etaL,yi/m. Neurol 31:223. 1992; Yu, et 
5 a!.. Am. J. Hum. Genet 5^:63 i, 1994). Identification of new genes should add 
considerably to the unfolding of the genetic determinants and enable biochemical and 
genetic approaches to diagnosis and treatment. 

The present invention provides a novel, previously unidentified locus for 
AD, methods and compositions for diagnosis and treatment of AD, and further provides 
1 0 other, related advantages. 

Summary of the Invention 

Briefly stated, the present invention provides isolated nucleic acid 
molecules encoding an AD4 (also known as STM2) gene product. Within one 

15 embodiment, a representative nucleic acid molecule is provided in Figures lA and B 
(hereinafter referred to as Figure 1 ). Within other embodiments, nucleic acid molecules 
are provided which encode a mutant AD4 gene product that increases the probability of 
Alzheimer's Disease (in a statistically significant manner) One representative 
illustration of such a mutant is an amino acid subsitution at residue 141, wherein, for 

20 example, an isoleucine may be substituted for an asparagine. 

Within other aspects of the present invention, isolated nucleic acid 
molecules are provided, selected from the group consisting of (a) an isolated nucleic 
acid molecule as set forth in Figure I or complementary sequence thereof, (b) an 
isolated nucleic acid molecule that specifically hybridizes to the nucleic acid molecule 

25 of (a) under conditixins of high stringency, and (c) an isolated nucleic acid thai encodes 
an AD4 gene product. As utilized herein, it should he understood that a nucleic acid 
molecule hybridizes "specifically" to an AD4 gene (or related sequence) if it hybridizes 
detectably to such a sequence, but does not significantly or deleciably hybridize to the 
AD3 gene sequence under the same conditions. Within other aspects, isolated DNA 

30 molecules are provided that contain genomic sequences, such as the sequences 
presented in Figures 13-19. 

Within other aspects, expression vectors are provided comprising a 
promoter operably linked to one of the nucleic acid molecule described above. 
Representative examples of suitable promoters include tissue-specific promoters, as 

35 well as promoters such as the CMV I-E promoter, SV40 early promoter and MuLV 
LTR. Within related aspects, viral vectors are provided that are capable of directing the 
expression of a nucleic acid molecule as described above. Representative examples of 
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such viral vectors include herpes simplex viral vectors, adenoviral vectors, adenovirus- 
associated viral vectors and retroviral vectors. Also provided are host cells (e.g.. 
human, dog, monkey, rat or mouse cells) which carry the above-described vectors. 

Within other aspects of the present invention, isolated proteins or 
5 polypeptides are provided comprising an AD4 gene product, as well as AD4 peptides of 
greater than 12, 13, or 20 amino acids. Within one embodiment, a protein is provided 
that has the amino acid sequence set forth in Figure 2. Within another embodiment, the 
protein is a mutant ADA gene product that increases the probability of Alzheimer's 
Disease. Such mutaius include those with aji amino acid subsitution at residue 141 

10 (e.j?., an aspargine to isoleucine substitution). Within yet a further embodimenu AD4 
peptides are provided which are composed of 1 3 to 20 amino acids derived or selected 
from the N-terminal, internal, or carboxy -terminal hydrophilic regions. 

Within yet another aspect of the present invention, methods of treating 
or preventing Alzheimer's Disease are provided, comprising the step of administering to 

1 5 a patient a vector containing or expressing a nucleic acid molecule as described above, 
thereby reducing the likelihood or delaying the onset of Alzheimer's Disea,sc in the 
patient. Within a related aspect, methods of treating or preventing Alzheimer's Disease 
are provided, comprising the step of administering to a patient a protein as described 
above, thereby reducing the likelihood or delaying the onset of Alzheimer's Disease in 

20 the patient. Within yet another related aspect, methods are provided for treating or 
preventing Alzheimer's Disease comprising the step of administering to a patient an 
antibody specific for an AD4 protein as described above, thereby reducing the 
likelihood or delaying the onset of Alzheimer's Disease in the patient. Within certain 
embodiments, the above methods may be accomplished by in vivo administration. 

25 Also provided by the present invention are pharmaceutical compositions 

comprising a nucleic acid molecule, vector, host cell, protein, or antibody as described 
above, along with a pharmaceutically acceptable carrier or diluent. 

Within other aspects of the present invention, antibodies are provided 
which specifically bind to an AD4 protein, or to unique peptides derived from the N- 

30 terminal, internal, or carboxy-terminai hydrophilic regions. As utilized herein, it should 
be understood that an antibody is specific for an AD4 protein if it binds detectably, and 
with a of 10-^M or less (e.g., lO'^M, lO'^M, etc.), but does not bind detectably (or 
with an affinity of greater than lO'^M, (e.g.. lO'^M, lO'^M, etc.) to the AD3 protein. 
Also provided are hybridomas which are capable of producing such antibodies. 

35 Within other aspects of the present invention, nucleic acid probes are 

provided which are capable of specifically hybridizing (as defined above) to an AD4 
gene under conditions of high stringency. Within one related aspect, such probes 
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comprise at least a portion of the nucleotide sequence shown in Figure 1 or its 
complementary sequence, the probe being capable of specifically hybridizing to a 
mutant AD4 gene under conditions of high stringency. Within one particularly 
preferred aspect, probes are provided that are capable of specifically hybridizing to a 
5 mutant AD4 gene in which amino acid residue 141 is changed from asparagine to 
isoleucine. under conditions of very high stringency. Within other related aspects, 
probe are provided which are capable of specifically hybridizing to at least a portion of 
the nucleic sequence shown in any of Figures 13-19. Representative probes of the 
present invention arc generally al least 12 nucleotide bases in length, although they may 

10 be 14, 16, 18 bases or longer. Also provided are primer pairs capable of specifically 
amplifying all or a portion of any of the nucleic acid molecules disclosed herein. 

Within other aspects of the invention, methods arc provided for 
diagnosing a patient having an increased likelihood of contracting Alzheimer's Disease, 
comprising the steps of (a) obtaining from a patient a biological sample containing 

15 nucleic acid, (b) incubating the nucleic acid with a probe which is capable of 
specifically hybridizing to a mutant Al>4 gene under conditions and for time sufficient 
to allow hybridization to occur, and (c) detecting the presence of hybridized probe, and 
thereby determining that said patient has an increased likelihood of contracting 
Alzheimer's Disease. Within another aspect, methods are provided comprising the 

20 steps of (a) obtaining from a patient a biological sample containing nucleic acid, (b) 
amplifying a selected nucleic acid sequence associated with a mutant AD4 gene, and (c) 
detecting the presence of an amplified nucleic acid sequence, and thereby determining 
that the patient has an increased likelihood of contracting Alzheimer's Disease. Within 
yet another aspect, methods are . provided comprising the steps of (a) contacting a 

25 biological sample obtained from a patient with an antibody that specifically binds to a 
mutant AD4 protein, under conditions and for a time sufficient to allow binding of the 
antibody to the protein, and (b) detecting the presence of the bound antibody. Suitable 
biological samples include nucleated cells obtained from the peripheral blood, fi-oni 
buccal swabs, or brain tissue. 

30 Within another aspect, peptide vaccines are provied which comprise a 

portion of a mutant AD4 gene product containing a mutation, in combination with a 
pharmaceutical ly acceptable carrier or diluent. 

Within yet another aspect, transgenic animals are provided whose germ 
cells and somatic ceils contain an AD4 gene which is operably linked to a promoter 

35 effective for the expression of the gene, the gene being introduced into the animal, or an 
ancestor of the animal, at an embryonic stage. Within one embodiment, the animal is a 
mouse, rat or dog. Within other embodiments, the AD4 gene is expressed from a vector 
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as described above. Within yet another embodiment, the AD4 gene encodes a mutant 
AD4 gene product. 

These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In addition, 
5 various references are set forth herein which describe in more detail certain procedures 
or compositions {e.g., plasmids, etc.), and are therefore incorporated by reference in 
their entirety. 

Brief Description of the Drawin^s 
1 0 Figures I A and I B depict the nucleotide sequence of an AIM gene. 

Figure 2 depicts the amino acid sequence of an AD4 gene product. 
Figure 3 depicts the sequence alignment of a representative AD4 gene 
product to the AD3 gene product. Vertical bars between the sequences indicate 
sequence identity. Colons between the sequences indicate strong sequence similarity. 
15 Dots between the sequences indicate medium sequence similarity. Dots inserted into 
either sequence indicate gaps inserted into a sequence so as to maximize the score of 
the alignment. 

Figure 4 depicts a hydropathy plot of an AD4 gene product. The plot 
indicates that there are 7 putative transmembrane regions of this gene product, and 

20 indicates a mutation at residue 141 (asparaginc to isoleucinc). 

Figure 5 depicts certain sequence features of a representative AD4 gene 
product, including a conserved sequence of 4 amino acids (KPGL) in positions 202-205 
and 251-254, each of which lie at the C-terminus of a putative membrane spanning 
regions and are conserved in the human and mouse AD3 gene products. Figure 5 also 

25 depicts a putative glycosylation site (asparaginc 366). 

Figure 6 is a dotplot which compares the human AD3 gene product and 
a representative AD4 gene product. The dark blocks indicate sequence similarity 
between the putative membrane spanning regions. 

Figure 7 depicts a family tree showing the segregation of chromosome 1 

30 markers in the R family. The bars beneath each subject represent minimum 
recombinant haplotypes constructed using the 23 markers from DIS238 to DIS235 
(Table 2). Alleles shown (from top to bottom) are for D1S249, DIS237, D1S479. 
D1S439 and D1S225, where A is the largest published allele. The solid bar represents 
the haplotypes segregating with the disease. Other haplotypes are represented by bars 

35 with different hatching or marking. A partially missing bar indicates markers not 
genotyped (subject IV- 15), A " " symbol represents regions where the phase cannot be 
assigned. APOE genotypes are shown directly above the haplotypes with 3 and 4 being 
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the £3 and £4 alleles, respectively. Recombinants were observed on the disease 
haplotypes between D1S479/DIS439 and DIS225 in subjects V-1 and V-3, and 
between DIS306 and D1S310 in subject IV- 1 8 (not shown). In IV- 16 there is a 
recombination between DIS2I7 and D1S237. A total of 4 recombinations (3 on non- 
5 disease related chromosomes) were observed between DIS249 and D1S237 in the 27 
chromosomes in generation IV. The distance between these markers is 12 cM and the 
chance that up to 4 recombinants would be observed is 41%. 

Figure 8 depicts the location of chromosome I markers. Distances are 
given in centimorgans (sex -averaged) with the use of the Kosambi map function. The 3 
1 0 maps are not independent as all contain some common genotypes. 

Figure 9 sets forth the sequences of primers used to amplify 
polymorphic regions of chromosome 1, 

Figure 10 presents the DNA sequence of clone 788iih5, EST T()3796 
and clones EH 1.1 and ELL 1.2. 
15 Figure 11 presents the DNA sequences of primers used in PGR or 

sequencing of the AD4 gene sequence. 

Figure 12 presents an elhidium bromide stained gel of genomic DNA 
amplified with WWF and INTIR primers. Lanes are: K at-risk, 79 years; 2, affected, 
onset 60 years; 3, at-risk, 67 years of age; 4, affected, onset 75 years (presumed 
20 phenocopy); 5, Boehringer Mannheim DNA size standard V; 6, affected, onset at 52 
years; 7, affected, onset 46 years; 8, affected, onset 49 years. 

Figure 13 presents a partial genomic DNA sequence of AD4. 

Figure 14 presents a partial genomic DNA sequence of AD4. 

Figure 1 5 presents a partial genomic DNA sequence of AD4. 
25 Figure 1 6 presents a partial genomic DNA sequence of AD4. 

Figure 17 presents a partial genomic DNA sequence of AD4. 

Figure 18 presents a partial genomic DNA sequence of AD4. 

Figure 19 presents a partial genomic DNA sequence of AD4. 

Figure 20 shows expression pattern of STM2 in different tissues. 

30 

Detailed Description of the Invention 
DEFiNITIONS 

Prior to setting forth the invention in detail, it may be helpful to an 
35 understanding thereof to set forth definitions of certain terms and to list and to define 
the abbreviations that will be used hereinafter. 
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^' Genetic marker " is any segment of a chromosome that is 
distinguishably unique in the genome, and polymorphic in the population so as to 
provide information about the inheritance of linked DNA sequences, genes and/or other 
markers. 

5 " Autosomal dominant " means that a trait is encoded on one of the non- 

sex chromosomes (autosomes) and is dominant for the phenotype it dictates for an 
individual having a heterozygous state. 

" LOP score " is a standard measure in genetics of the likelihood of a trail 
being localized in the interval being scored. Is the logarithm of a calculated probability. 

10 " Vector " refers to an assembly which is capable of directing the 

expression of an AD4 gene, as well as any additional sequence(s) or gene{s) of interest. 
The vector must include transcriptional promoter elements which are operably linked to 
the genes of interest. The vector may be composed of either deoxyribonucleic acids 
("DNA"), ribonucleic acids ("RNA"), or a combination of the two (e.^'., a DNA-RNA 

15 chimeric). Optionally, the vector may include a polyadenylation sequence, one or more 
restriction sites, as well as one or more selectable markers such as neomycin 
phosphotransferase or hygromycin phosphotransferase. Additionally, depending on the 
host cell chosen and the vector employed, other genetic elements such as an origin of 
replication, additional nucleic acid restriction sites, enhancers, sequences conferring 

20 tnducibiiity of transcription, and selectable markers, may also be incorporated into the 
vectors described herein. 

Abbreviations : AD, Alzheimer's disease; APP, amyloid precursor 
protein gene; APLPl and APLP2, amyloid precursor like proteins; CNS, central 
nervous system; DS, Down syndrome; FAD, familial AD; HCHWA-D, hereditary 

25 cerebral hemorrhage with amyloidosis - Dutch type; (Zj^iax)^ maximum LOD score; 
NFTs, neurofibrillary tangles; STRP, short landem repeal polymorphism; 0, 
recombination fraction; VG, Volga German; AD3, the designation given to the 
chromosome 14 early-onset FAD gene; YAC, yeast artificial chromosome; EST, 
espressed sequence tag; PGR, polymerase chain reaction; RT-PCR, PGR process in 

30 which RNA is first transcribed into DNA at the first step using reverse transcriptase 
(RT); cDNA, any DNA made by copying an RNA sequence into DNA form. 

As noted above^ the present invention provides methods and 
compositions for the detection and treatment of Alzheimer's Disease. These methods 
35 and compositions are based upon the discovery of a gene which is located on 
chromosome U which when mutated increases the probability of Alzheimer's Disease. 
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This gene, designated AD4, was found to be located on chromosome 1 at locus lq3J- 



4 



Although one embodiment of an AD4 gene is disclosed in Figure I , it 
should be understood that the present invention is not so limited. In particular, within 
5 the context of the present invention reference to the AD4 gene should be understood to 
include derivatives, analogs, or allelic variants of the gene disclosed in Figure I that are 
substantially similar. As used herein, a nucleic acid molecule is deemed to be 
"substantially similar" if: (a) the nucleotide sequence is derived from the coding region 
Of the described gene and includes portions of the sequence or allelic variations of the 
10 sequences discussed above; (b) the nucleotide sequence is capable of hybridization to 
nucleotide sequences of the present invention under high or very high stringency {see 
Sambrook et al., Molecular Cloning: A Laboratory Manual. 2nd ed.. Cold Spring 
Harbor Laborator>' Press, NY, 1989); or (c)the DNA sequences are degenerate as a 
result of the genetic code to the DNA sequences defined in (a) or (b). In addition, the 

15 genomic sequence may contain only polymorphisms, either in the coding or non- 
translated region, or insertions, deletions and the like. Further, the AD4 gene includes 
both complementary and non-complementary sequences, provided the sequences 
otherwise meet the criteria set forth herein. Within the context of the present invention, 
high stringency means standard hybridization conditions (e.;?., 5x SSPE, 0.5% SDS at 

20 65^C, or the equivalent), such that an appropriate nucleotide sequence is able to 
selectively hybridize to nucleotide sequences from the AD-related gene, and to mutant 
nucleotide sequences. Very high stringency means the nucleotide sequence is able to 
selectively hybridize to a single allele of the AD-related gene. 

The AD4 gene may be isolated from genomic DNA or cDNA. Genomic 

25 DNA libraries constructed in vectors, such as YACs (yeast artificial chromosomes), 
bacteriophage vectors, such as XEMBL3, XgtlO, cosmids or plasmids, are suitable for 
use. cDNA libraries constructed in bacteriophage vectors, plasmids, or others, arc 
suitable for screening. Such libraries may be constructed using methods and techniques 
knov^ in the art {see Sambrook et al.. Molecular Cloning: A Laboratory Manual. Cold 

30 Spring Harbor Press, 1989) or purchased from conunercial sources {e.g., Clontech). 
Within one embodiment, the AD4 gene is isolated by PCR performed on genomic 
DNA, cDNA or libraries, or by probe hybridization of genomic DNA or cDNA 
libraries. Primers for PCR and probes for hybridization screening may be designed 
based on the DNA sequence of AD4 presented herein. The DNA sequence of an AD4 

35 gene is presented in Figure lA and B; and the predicted amino acid sequence is 
presented in Figure 2. Primers for PCR should be derived from sequences in the 5* and 
3* untranslated region in order to isolate a ftill-Iength cDNA. The primers should not 
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have self-complcmenlary sequences nor have complementary sequences at their 3' end 
(to prevent primer-dimer formation). Preferably, the primers have a GC content of 
about 50% and contain restriction sites. The primers are annealed to cDNA and 
sufficient cycles of PGR are performed to yield a product readily visualized by gel 
5 electrophoresis and staining. The amplified fragment is purified and inserted into a 
vector, such as XgtlO or pBS(M13-»-)» and propagated. An oligonucleotide 
hybridization probe suitable for screening genomic or cDNA libraries may be designed 
based on the sequence provided herein. Preferably, the oligonucleotide is 20-30 bases 
long. Such an oligonucleotide may be synthesized by automated synthesis. The 

10 oligonucleotide may be conveniently labeled at the 5' end with a reporter molecule, 
such as a radionuclide, {e.^.. ^^p) ^r biolin. The library is plated as colonies or phage, 
depending upon the vector, and the recombinant DNA is transferred to nylon or 
nitrocellulose membranes. Following denaturation. neutralization, and fixation of the 
DNA to the membrane, the membranes are hybridized with the labeled probe. The 

1 5 membranes are washed and the reporter molecule detected. The hybridizing colonies or 
phage are isolated and propagated. Candidate clones or PGR amplified fragments may 
be verified as containing AD4 DNA by any of various means. For example, the 
candidate clones may be hybridized with a second, nonoverlapping probe or subjected 
to DNA sequence analysis. In these ways, clones containing AD4 gene, which arc 

20 suitable for use in the present invention are isolated. 

The structure of the proteins encoded by the nucleic acid molecules 
described herein may be predicted from the primary translation products using the 
hydrophobicity plot function of, for example, P/C Gene or Intelligenetics Suite 
(Inielltgenetics, Mountain View, Calif.), or according to the methods described by Kyte 

25 and Doolittic {J. Mol Biol. 757:105-132, 1982). One such structure is depicted in 
Figure 4, which is a hydropathy plot of an AD4 gene product. As is evident from lliis 
figure, the plot depicts a protein with an extracellular domain, three extracellular loop 
domains, three intracellular loop domains, seven transmembrane regions, and an 
intracellular domain. 

30 AD4 proteins of the present invention may be in the form of acidic or 

basic salts, or in neutral fomi. In addition, individual amino acid residues may be 
modified by oxidation or reduction. Furthermore, various substitutions, deletions, or 
additions may be made to the amino acid or nucleic acid sequences, the net effect of 
which is to retain or further enhance or decrease the biological activity of the mutant or 

35 wild-type protein. Moreover, due to degeneracy in the genetic code, there may be 
considerable variation in nucleotide sequences encoding the same amino acid sequence. 
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Other derivatives of the AD4 proteins disclosed herein include 
conjugates of the proteins along with other proteins or polypeptides. This may be 
accomplished, for example, by the synthesis of N-terminal or C-terminal fusion 
proteins which may be added to facilitate purification or identification of AD4 proteins 
5 {see U.S. Patent No. 4,851,341, see also. Hopp ct al., Bio/Technology d:]204, 1988.) 
Alternatively, fusion proteins such as AD4.pgalactosidase or AD4-Iuciferase may be 
constructed in order to assist in the identification, expression, and analysis of AD4 
proteins. 

AD4 proteins of the present invention may be constructed using a wide 
10 variety of techniques including for example, that described in Example 5. Further, 
mutations may be introduced at particular loci by synthesizing oligonucleotides 
containing a mutant sequence, flanked by restriction sites enabling ligation to fragments 
of the native sequence. Following ligation, the resulting reconstructed sequence 
encodes a derivative having the desired amino acid insertion, substitution, or deletion. 

Alternatively, oligonucleoiide-directcd site-specific (or segment 
specific) mutagenesis procedures may be employed to provide an altered gene having 
particular codons altered according to the substitution, deletion, or insertion required. 
Exemplary methods of making the alterations set forth above are disclosed by Walder 
etal. {Gene ^2:133. 1986); Bauer etal, (Gene J7:73, 1985); Craik {BioTechniques. 
January 1985, 12-19); Smith etal. {Genetic Engineering: Principles and Methods. 
Plenum Press, 1981); and Sambrook et al. {supra). Deletion or truncation derivatives 
of AD4 proteins {e.g., a soluble extracellular portion) may also be constructed by 
utilizing convenient restriction endonuclease sites adjacent to the desired deletion. 
Subsequent to restriction, overhangs may be filled in, and the DNA relegated. 
25 Exemplary methods of making the alterations se! forth above are disclosed by 
Sambrook et al. (Molecular Cloning: A Laboratory Manual 26 Ed., Cold Spring 
Harbor Laboratory Press, 1989). 

Mutations, which are made in the nucleic acid molecules of the present 
invention preferably preserve the reading frame of the coding sequences. Furthermore, 
30 the mutations will preferably not create complementary regions that could hybridize to 
produce secondary mRNA structures, such as loops or hairpins, that would adversely 
affect translation of the mRNA. Although a mutation site may be predetermined, it is 
not necessary thai the nature of the mutation per se be predetermined. For example, in 
order to select for optimum characteristics of mutants at a given site, random 
35 mutagenesis may be conducted at the target codon and the expressed mutants screened 
for indicative biological activity. Alternatively, mutations may be introduced at 
particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked 



20 



BNSDOCID: <WO 9703I92A3_IA> 



wo 97/03192 PCT/US96/1 1386 

13 

by restriction sites enabling ligation to fragments of the native sequence. Following 
ligation, the resulting reconstructed sequence encodes a derivative having the desired 
amino acid insertion, substitution, or deletion. 

AD4 proteins may also be constructed utilizing techniques of PCR 
5 mutagenesis, chemical mutagenesis (Drinkwater and Klinedinst, PNAS ^5:3402-3406, 
1986), by forced nucleotide mis incorporation (e.g., Liao and Wise Gene 88:\01-\\]^ 
1990), or by use of randomly mutagenized oligonucleotides (Horwitz et al.. Genome 
3A 12-117, 1989). Particularly preferred methods for constructing Alzheimer Disease 
Proteins are set forth in more detail below in the Examples. 

10 Proteins can be isolated by, among other methods, culturing suitable 

host and vector systems to produce the recombinant translation products of the present 
invention. Supematants from such cell lines, or protein inclusions or whole cells where 
the protein is not excreted into the supernatant, can then be treated by a variety of 
purification procedures in order to isolate the desired proteins. For example, the 

15 supernatant may be first concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit. 
Following concentration, the concentrate may be applied to a suitable purification 
matrix such as. for example, an anti -protein antibody bound to a suitable support. 
Alternatively, anion or cation exchange resins may be employed in order to purify the 

20 protein. As a further alternative, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps may be employed to further purify the protein. 
Other methods of isolating the proteins of the present invention are well known in the 
skill of the art. 

A protein is deemed to be "isolated** within the context of the present 
25 invention if no other (undesired) protein is detected pursuant to SDS-PAGE analysis 
followed by Coomassie blue staining. Within other embodiments, the desired protein 
can be isolated such that no other (undesired) protein is delected pursuant to SDS- 
PAGE analysis followed by silver staining. 

The present invention also provides for the manipulation and expression 
30 of the above described genes by culturing host cells containing a vector capable of 
. expressing the above-described genes. Such vectors or vector constructs include either 
synthetic or cDNA-derived nucleic acid molecules encoding AD4 proteins, which arc 
operabiy linked to suitable transcriptional or translational regulatory elements. Suitable 
regulatory elements may be derived from a variety of sources, including bacteriaL 
35 fungal, viral, mammalian, insect, or plant genes. Selection of appropriate regulatory 
elements is dependent on the host cell chosen, and may be readily accomplished by one 
of ordinary skill in the art. Examples of regulatory elements include: a transcriptional 
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promoter and enhancer or RNA polymerase binding sequence, a transcriptional 
terminator, and a ribosoma! binding sequence, including a translation initiation signal. 

Nucleic acid molecules that encode any of the AD4 proteins described 
above may be readily expressed by a wide variety of prokaryotic and cukaryotic host 
5 cells, including bacterial, mammaUan, yeast or other fiingi, viral, insect, or plant cells. 
Methods for transforming or transfecting such cells to express foreign DNA are well 
known in the art (see, e.g., Itakura et al., U.S. Patent No. 4,704,362; Hinnen ei al., Frac. 
Nail. Acad ScL USA 75:1929-1933, 1978; Murray el al., U.S. Patent No. 4,801.542; 
Upshall ct al., U.S. Patent No. 4,935,349; Ilagcn ci al., U.S. Patent No. 4,784,950; Axel 

10 etal., U.S. Patent No. 4,399,216; Goeddel eta!., U.S. Patent No. 4,766,075; and 
Sambrook et al. Molecular Cloning: A Laboratory ManuaL 2nd edition, Cold Spring 
Harbor Laboratory Press, 1989; for plant cells see Czako and Marton, Plant Physiol 
!04:\067-\01U 1994; and Paszkowski et al., Biotech. 2-/:387-392, 1992). 

Bacterial host cells suitable for carrying out the present invention 

15 include E. coli, B. subtilis. Salmonella typhimurium, and various species within the 
genus* Pseudomonas. Streptomyces, and Staphylococcus, as well as many other 
bacterial species well known to one of ordinary skill in the art. Representative 
examples of bacterial host cells include DH5a (Stratagene, LaJoIla, California). 

Bacterial expression vectors preferably comprise a promoter which 

20 functions in the host cell, one or more selectable phenotypic markers, and a bacterial 
origin of replication. Representative promoters include the p-laclamasc (penicillinase) 
and lactose promoter system (see Chang etal.. Nature 275:615, 1978), the T7 RNA 
polymerase promoter (Studier et al., Meth. Enzymol. /5J;60-89, 1990), the lambda 
promoter (Elvin et al.. Gene (97:123-126, 1990), the trp promoter (Nichols and 

25 Yanofsky, Meth. in Enzymology /(;/:! 55, 1983) and the tac promoter (Russell et al.. 
Gene 20: 231, 1982). Representative selectable markers include various antibiotic 
resistance markers such as the kanamycin or ampicillin resistance genes. Many 
plasmids suitable for transforming host cells are well known in the art, including among 
others, pBR322 (see Bolivar etal.. Gene 2:95, 1977), the pUC plasmids pUC18, 

30 pUC19, pUCllS, pUCn9 (see Messing, Meth in Enzymology 707:20-77, 1983 and 
Vieira and Messing, Gene 7^:259-268, 1982), and pNH8A, pNHI6a, pNHI8a, and 
Bluescript M13 (Stratagene, La Jolla, Calif.). 

Yeast and fungi host cells suitable for carrying out the present invention 
include, among others, Saccharomyces pombe, Saccharomyces cerevisiae^ the genera 

35 Pichia or Kluyveromyces and various species of the genus Aspergillus (McKnight et al., 
U.S. Patent No. 4,935,349). Suitable expression vectors for yeast and fungi include, 
among others, .YCp50 (ATCC No. 37419) for yeast, and the amdS cloning vector pV3 
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(Tumbull, Bio/Technology 7:169» 1989), YRp7 (Slruhl etal., Proc. NaiL Acad. Sci. 
USA 7d:l035-1039, 1978), YEpl3 (Broach etaK. Gene ^:121-133, 1979), pJDB249 
and pJDB219 (Beggs, Nature 275:104-108, 1978) and derivatives thereof 

Preferred promoters for use in yeast include promoters from yeast 
5 glycolytic genes (Hitzcman etal., J. Biol Chem. 255:12073-12080, 1980; Alber and 
Kawasaki, J. Mol. Appi GeneL 7:419-434, 1982) or alcohol dehydrogenase genes 
(Young et aL, in Genetic Engineering of Microorganisms for Chemicals, Ifollaender 
et al. (eds.), p. 355, Plenum. New York, 1982; Ammerer, Metk EnzymoL ;0/:192-201, 
1 983)- Examples of useful pronioiei-s for fungi vectors include those derived from 

10 Aspergillus nidulans glycolytic genes, such as the adhS promoter (McKnight et ah, 
EMBOl ^:2093-2099, 1985). The expression units may also include a transcriptional 
terminator. An example of a suitable terminator is the adh3 terminator (McKnight 
eiaK, f7>/V/., 1985). 

As with bacterial vectors, the yeast vectors will generally include a 

15 selectable marker, which may be one of any number of genes that exhibit a dominant 
phenotype for which a phenotypic assay exii^io-jsnable iransformants to be selected. 
Preferred selectable markers are those that complement host cell auxolrophy, provide 
antibiotic resistance or enable a cell to utilize specific carbon sources, and include leu2 
(Broach etal., ibid), uraJ (Botstein etal.. Gene 6f:17, 1979), or his3 (Slruhl ct ah, 

20 ibid). Another suitable selectable marker is the cat gene, which confers 
chloramphenicol resistance on yeast cells. 

Techniques for transforming fungi are well known in the literature, and 
have been described, for instance, by Beggs {ibid.), Hinnen et al. {Proc. Natl. Acad. Sci. 
USA 75:1929-1933, 1978), Yelton etal. (Proc. Nati Acad Sci. USA 8l:\7A0~\741, 

25 1984), and Russell {Nature 307:167-169, 1983). The genotype of ihe host cell may 
contain contain a genetic defect that is complemented by the selectable marker present 
on the expression vector. Choice of a particular host and selectable marker is well 
within the level of ordinary skill in the art. 

Protocols for the u^sfonnation of yeast are also well known to those of 

30 ordinary skill in the art. For example, transformation may be readily accomplished 
either by preparation of spheroplasts of yeast with DNA (see Hinnen et al., PNAS USA 
75:1929. 1978) or by treatment with alkaline salts such as LiCl (see Itoh etal., J. 
Bacteriology 753:163, 1983). Transformation of fungi may also be carried out using 
polyethylene glycol as described by Cullen et al. {Bio/Technology 5:369, 1987). 

35 Viral vectors include those which comprise a promoter that directs the 

expression of an isolated nucleic acid molecule that encodes an Alzheimer Disease 
Protein as described above. A wide variety of promoters may be utilized within the 
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context of the present invention, including for example, promoters such as MoMLV 
LTR, RSV LTR, Friend MuLV LTR, Adenoviral promoter (Ohno et al.. Science 265: 
781-784, 1994), neomycin phosphotransferase promoter/enhancer, late parvovirus 
promoter (Koering etal., Hum, Gene Therap. 5:457-463, 1994), Herpes TK promoter, 
5 SV40 promoter, meiallothionein Ila gene enhancer/promoter, cytomegalovirus 
immediate early promoter, and the cytomegalovirus immediate late promoter. Within 
particularly preferred embodiments of the invention, the promoter is a tissue-specific 
promoter (see e.g., WO 91/02805; EP 0,415,731; and WO 90/07936). Representative 
examples of suitable tissue specific promoters include neural specific enolase promoter, 

10 platelet derived growth factor beta promoter, bone morpho-gcneiic protein promoter, 
human alpha I -chimaerin. promoter, synapsin I promoter and synapsin II promoter. In 
addition to the above-noted promoters, other viral-specific promoters {e.g., retroviral 
promoters (including those noted above, as well as others such as HIV promoters), 
hepatitis, herpes {e.g., EBV), and bacterial, fungal or parasitic {e.g., malarial) -specific 

15 promoters may be utilized in order to target a specific cell or tissue which is infected 
with a virus, bacteria, fungus or parasite. 

Thus, AD4 proteins of the present invention may be expressed from a 
variety of viral vectors, including for example, herpes viral vectors {e.g., U.S. Patent 
No. 5,288,641), adenoviral vectors (e.g, WO 94/26914, WO 93/9191; KoIIs et al., 

20 />A'/f^P7{l):215-219, 1994; Kass-Eisler et al., Pf?(24);l 1498-502, 1993; Guzman 

el al.. Circulation 5<^(6):2838-48, 1993; Guzman el al., Cir. Res. 7J(6):1202-1207, 
1993; Zabner et al., CeU 7J(2):207-216, 1993; Li et al., Hum Gene Ther. 4(4):403-409, 
1993; Cailiaud et al., Eur. J. Neurosci. 5(10:1287-1291, 1993; Vincent et al., Nai. 
Genet. 5(2): 130-134, 1993; Jaffe et a!., Nat. Genet. /(5):372-378, 1992; and Levrero et 

25 al. Gene /0;(2): 195-202, 1991), adeno-associated viral vectors (WO 95/13365; Flotte 
et al., FNAS PO(22):I0613-10617, 1993), baculovirus vectors, parvovirus vectors 
(Koering et al., Hum, Gene Therap. 5:457-463, 1994), pox virus vectors (Panicali and 
Paoletli, PNAS 7^:4927-4931, 1982; and Ozaki et al., Biochem. Biophys. Res. Comm. 
yPi(2):653-660, 1993), and retroviruses (e.g.. EP 0,415,731; WO 90/07936; WO 

30 91/0285, WO 94/03622; WO 93/25698; WO 93/25234; U.S. Patem No. 5,219,740; WO 
93/1 1230; WO 93/10218. Viral vectors may likewise be constructed which contain a 
mixture of different elements (e.g., promoters, envelope sequences and the like) from 
different viruses, or non-viral sources. Within various embodiments, either the viral 
vector itself, or a viral particle which contains the viral vector may be utilized in the 

35 methods and compositions described below. 

Mammalian cells suitable for carrying out the present invention include, 
among others:. PC12 (ATCC No. CRL1721), NlE-n5 neuroblastoma, SK-N-BE(2)C 
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neuroblastoma, SHSY5 adrenergic neuroblastoma, NS20Y and NG108-15 murine 
cholinergic cell lines, or rat F2 dorsal root ganglion line, COS (e.g., ATCC No. CRL 
1650 or 1651), BHK (e.g,, ATCC No. CRL 6281; BHK 570 cell line (deposited with 
the American Type Culture Collection under accession number CRL 10314), CHO 
5 (ATCC No. CCL 61), HeLa (e.g.^ ATCC No. CCL 2), 293 (ATCC No. 1573; Graham 
et al., J. Gen. Virol. i6:59-72, 1977) and NS-I cells. Other mammalian cell lines may 
be used within the present invention, including Rat Hep I (ATCC No. CRL 1600), Rat 
Hep II (ATCC No. CRL 1548), TCMK (ATCC No, CCL 139), Human lung {ATCC 
No. CCL 75.1), Human hepatoma (ATCC No. HTn.52), Hep G2 (ATCC No. HB 

10 8065), Mouse liver (ATCC No. CCL 29.1), NCTC 1469 (ATCC No. CCL 9.1), SP2;0- 
Agl4 (ATCC No. 1581), H1T-T15 (ATCC No. CRL 1777), and RlNm 5AHT2B 
(Orskov and Nielson, FEES 229{ 1 ): 1 75- 1 78, 1 988). 

Mammalian expression vectors for use in carrying out the present 
invention will include a promoter capable of directing the transcription of a cloned gene 

15 or cDNA. Preferred promoters include viral promoters and cellular promoters. Viral 
promoters include the cytomegalovirus immediate early promoter (Bosharl et aL, Cell 
•/y:521-530, 1985)» cytomegalovirus immediate late promoter, SV40 promoter 
(Subramani el al., MoL Ceil. BioL 7:854-864, 1981), MMTV LTR, RSV LTR, 
metallothionein-1, adenovirus El a. Cellular promoters include the mouse 

20 metal lothionein-1 promoter (Palmiter etal., U.S. Patent No. 4,579,821), a mouse V,^ 
promoter (Bergman et ah, Proc. Natl Acad. ScL USA 5/:704l-7045, 1983; Grant etal., 
Nucl. Acid^ Res. 75:5496, 1987) and a mouse Vh promoter (Loh et al.. Cell iJ:85-93, 
1983). The choice of promoter will depend, at least in part, upon the level of 
expression desired or the recipient cell line to be iransfected. 

25 Such expression vectors may also contain a set of RNA splice sites 

located downstream from the promoter and upstream from the DNA sequence encoding 
the peptide or protein of interest. Preferred RNA splice sites may be obtained from 
adenovirus and/or immunoglobulin genes. Also contained in the expression vectors is a 
polyadenylation signal located downstream of the coding sequence of interest. Suitable 

30 polyadenylation signals include the early or late polyadenylation signals from SV40 
(Kaufman and Sharp, ibid), the polyadenylation signal from the Adenovirus 5 ElB 
region and the human growth hormone gene terminator (DeNoto et al., Nuc. Acids Res. 
9:3719-3730, 1981). The expression vectors may include a noncoding viral leader 
sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and 

35 the RNA splice sites. Preferred vectors may also include enhancer sequences, such as 
the SV40 enhancer and the mouse Ig heavy chain enhancer (Gillies, Cell ii:71 7-728. 
1983). Expression vectors may also include sequences encoding the adenovirus VA 
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RNAs. Suitable expression vectors can be obtained from commercial sources {e.g., 
Stratagcne, La Jolla, Calif.). 

Vector constructs comprising cloned DNA sequences can be introduced 
into cultured mammalian cells by, for example, calcium phosphate-mediated 
5 transfection (Wigler et al.. Cell 14:125, 1978; Corsaro and Pearson, Somatic Cell 
Genetics 7:603, 1981; Graham and Van derEb, Virology 52 ASd, 1973), eiectroporation 
(Neumann ctaJ., EMBO J. 7:841-845, 1982), or DEAE-dextran mediated transfection 
(Ausubel et al. (eds.), Current Protocols in Molecular Biology, John Wiley and Sons, 
Inc., NY. 1987). To identify cells that have stably integrated the cloned DNA. a 

10 selectable marker is generally introduced into the cells along with the gene or cDNA of 
interest. Preferred selectable markers for use in cultured mammalian cells include 
genes that confer resistance to drugs, such as neomycin, hygromycin, and methoUexatc. 
The selectable marker may be an ampiifiable selectable marker. Preferred amplifiable 
selectable markers are the DHFR gene and the neomycin resistance gene. Selectable 

1 5 markers are reviewed by Thilly {Mammalian Cell Technolog}\ Butterwonh Publishers, 
Stoneham, MA, which is incorporated herein by reference). 

Mammalian cells containing a suitable vector arc allowed lo grow for a 
period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest. 
Drug selection is then applied to select for growth of cells that are expressing the 

20 selectable marker in a stable fashion. For cells that have been transfected with an 
amplifiable, selectable marker the drug concentration may be increased in a stepwise 
manner to select for increased copy number of the cloned sequences, thereby increasing 
expression levels. Cells-expressing the introduced sequences are selected and screened 
for production of the protein of interest in the desired form or at the desired level. Cells 

25 that satisfy these criteria can then he cloned and scaled up for production. 

Protocols for the transfection of mammalian cells are well known lo 
those of ordinary skill in the art. Representative methods include calcium phosphate 
mediated transfection, eiectroporation, lipofection, retroviral, adenoviral and protoplast 
fusion-mediated transfection {see Sambrook et al,, supra), 

30 Numerous insect host cells known in the art can also be useful within the 

present invention, in light of the subject specification. For example, the use of 
baculovtruses as vectors for expressing heterologous DNA sequences in insect cells has 
been reviewed by Atkinson et al. {PesUc, Set. 28:2 1 5-224, 1 990). 

Numerous plant host cells known in the art can also be useful within the 

35 present invention, in light of the subject specification. For example, the use of 
Agrobacterium rhizogenes as vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al., {J. BioscL (Bangalore) 7 7:47-58, 1987). 
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AD4 proteins may be prepared by growing (typically by culluring) the 
host/vector systems described above, in order to express the recombinant Alzheimer 
Disease Proteins. Recombinanily produced AD4 proteins may be further purified as 
described in more detail below. 
5 Within related aspects of the present invention, ADA proteins may be 

expressed in a transgenic animal whose germ cells and somatic cells contain an AD4 
gene which is operably linked to a promoter effective for the expression of the gene 
may also be expressed in non-human transgenic animals such as mice, rats, rabbits, 
sheep, dogs and pigs (sec Hanuner elal. Nature 5/5:680-683, 1985, Palmiter et ah 

10 Science 272:809-814, 1983. Brinstcr ctal. Proc. Natl. Acad Set. USA <?2:443 8-4442. 
1985, Palmiter and Brinsler Cell ^7:343-345, 1985 and U.S. Patent Nos. 5,175.383, 
5,087,571, 4J36,866, 5,387,742, 5,347,075, 5,221,778, and 5,175,384). Briefly, an 
expression vector, including a nucleic acid molecule to be expressed together with 
appropriately positioned expression control sequences, is introduced into pronuclei of 

15 fertilized eggs, for example, by microinjection. Integration of the injected DNA is 
delected by blot analysis of DNA from tissue samples. It is preferred that the 
introduced DNA be incorporated into the germ line of the animal so that it is passed on 
to the animal's progeny. Tissue-specific expression may be achieved through the use of 
a tissue-specific promoter, or through the use of an inducible promoter, such as the 

20 metallolhionein gene promoter (Palmiter elal., 1983, ibid), which allows regulated 
expression of the iransgene. 

Vectors of the present invention may contain or express a wide variety 
of additional nucleic acid molecules in place of or in addition to an AD4 protein as 
described above, either from one or several separate promoters. For example, the viral 

25 vector may express a lymphokine or lymphokine receptor, antisense or ribozymc 
sequence or toxins. Representative examples of lymphokines include lL-1, lL-2, IL-3. 
IL-4, IL-5, IL-6, iL-7, IL-S, IL-9, IL-10, IL-) I, IL-12, IL-13, IL-14, IL-15, GM-CSF, 
G-CSF, M-CSF, alpha-inlerferon, beta-interferon, gamma interferon, and tumor 
necrosis factors, as well as their respective receptors. Representative examples of 

30 antisense sequences include antisense sequences which block the expression of AD4 
protein mutants. Representative examples of toxins include: ricin, abrin, diphtheria 
toxin, cholera toxin, saporin, gelonin, pokeweed antiviral protein, Iritin, Shigella toxin, 
and Pseudomonas exotoxin A. 

Within other aspects of the invention, antisense oligonucleotide 

35 molecules are provided which specifically inhibit expression of mutant AD4 nucleic 
acid sequences {see generally, Hirashima et al. in Molecular Biology of RNA: New 
Perspectives (M. Inouye and B. S. Dudock, eds., 1987 Academic Press, San Diego, p. 
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401); Oligonucleotides: Antisense Inhibitors of Gene Expression (j.S. Cohen, ed., 1989 
MacMillan Press, London); Stein and Cheng, Science 2^7:1004-1012 (1993); WO 
95/10607; U.S. 5,359,051; WO 92/06693; and EP-A2-6 12844). Briefly, such 
molecules are constructed such that they are complementary to, and able to form 
5 Watson-Crick base pairs with, a region of transcribed AD4 mutant mllNA sequence 
containing an AD4 mutation. The resultant double-stranded nucleic acid interferes with 
subsequent processing of the mRNA, thereby preventing protein synthesis. 

Within other related aspects of the invention, ribozyme molecules arc 
provided wherein an antisense oligonucleotide sequence is incorporated into a ribozyme 

10 which can specifically cleave mRNA molecules transcribed from a mutant AD4 gene 
(see generally, Kim el al. Proc. Nat. Acad. ScL USA ^^:8788 (1987); Haseloff, et al. 
Nature 234:5^5 (1988), Cech, JAMA 260:3030 (1988); Jeffries, et al. Nucleic Acids 
Res. 7 7:1371 (1989); U.S. 5,093,246; U.S. 5,354,855; U.S. 5,144,019; U.S. 5,272,262; 
U.S. 5,254.678; and U.S. 4,987,071). According to this aspect of the invention, the 

15 antisense sequence which is incorporated into a ribozyme includes a sequence 
complementary to, and able to form Watson-Crick base pairs with, a region of the 
transcribed mutant AD4 mRNA containing an AD4 mutation. The antisense sequence 
thus becomes a targeting agent for delivery of catalytic ribozyme activity specifically to 
mutant AD4 mRNA, where such catalytic activity cleaves the mRNA to render it 

20 incapable of being subsequently processed for AD4 protein translation. 



Host Cells 

25 As discussed above, nucleic acid molecules which encode the Ab4 

proteins of the present invention (or the vectors which contain and/or express related 
mutants) may readily be introduced into a wide variety of host cells. Representative 
examples of such host cells include plant cells, eukaryotic cells, and prokaryotic cells. 
Within preferred embodiments, the nucleic acid molecules are introduced into cells 

30 from a vertebrate or warm-blooded animal, such as a human, macaque, dog, cow. horse, 
pig, sheep, rat, hamster, mouse or fish cell, or any hybrid thereof 

Preferred prokaryotic host cells for use within the present invention 
include E. coli. Salmonella, Bacillus, Shigella. Pseudomonas. Strepiomyces and other 
genera. Techniques for transforming these hosts and expressing foreign DNA 

35 sequences cloned therein are well known in the art (sec, e.g.^ Maniatis et al.. Molecular 
Cloning: A Laboratory Manual^ Cold Spring Harbor Laboratory, 1982, which is 
incorporated herein by reference; or Sambrook et aL, supra). Vectors used for 
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expressing cloned DNA sequences in bacterial hosts will generally conlain a selectabic 
marker, such as a gene for antibiotic resistance, and a promoter that functions in the 
host cell. Appropriate promoters include the trp (Nichols and Yanofsky, h4eth. 
Enzymol }01:\SSA(A. 1983), lac (Casadaban etal., J. Bacterial. 7^5:971-980, 1980), 
5 and phage k (Queen, ./. MoL Appl. Genet. 2:1-10, 1983) promoter systems. Piasmids 
useful for transforming bacteria include the pUC piasmids (Messing, Meth, Enzymol. 
/(;7:20-78, 1983; Vieira and Messing, Gene /P:259-268, 1982), pBR3 22 (Bolivar et al., 
Gene 2:95-1 13, 1977), pCQV2 (Queen, ibid,\ and derivatives thereof. Piasmids may 
contain both viral and bacterial elements. 

10 Preferred cukaryotic cells include cultured mammalian cell lines {e.g.. 

rodent or human cell lines) and fungal cells, including species of yeast {e,g., 
Saccharomyces spp., particularly S. cerevisiae, Schizosaccharomyccs spp., or 
Kluyveromyces spp.) or filamentous fungi (e.g,^ Aspergillus spp., Neurospora spp.). 
Strains of the ycasl Saccharomyces cercvisiae are particularly preferred. Methods for 

1 5 producing recombinant proteins in a variety of prokaryotic and eukaryotic host cells are 
generally known in the art (see, "Gene Expression Technology," Methods in 
Enzymolog}\ Vol. 185, Goeddel (ed.). Academic Press. San Diego, Calif. 1990; see 
also. "Guide to Yeast Genetics and Molecular Biology," Methods in EnzymologyK 
Guthrie and Fink (cds.). Academic Press, San Diego, Calif., 1991). In general, a host 

20 cell will be selected on the basis of its ability to produce the protein of interest at a high 
level or its ability to carry out at least some of the processing steps necessary for the 
biological activity of the protein. In this way, the number of cloned DNA sequences 
that must be introduced into the host cell can be minimized and overall yield of 
biologically active protein can be maximized. 

25 The nucleic acid molecules (or vectors) may be introduced into host 

cells by a wide variety of mechanisms, including for example calcium phosphate- 
mediated transfection (Wigier et al.. Cell 14:115, 1978), lipofection; gene gun (Corsaro 
and Pearson, Somatic Cell Gen. 7:603, 1981; Graham and Van der Eb, Virology 52:456, 
1973), elecu-oporation (Neumann et al., EMBO 1 7:841-845, 1982), retroviral, 

30 adenoviral, protoplast fusion-mediated transfection or DEAE-dextran mediated 
transfection (Ausubel et al., (eds.). Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., NY, NY, 1987). 

Host cells containing vector constructs of the present invention are then 
cultured to express a DNA molecule as described above. The cells are cultured 

35 according to standard methods in a culture medium containing nutrients required for 
growth of the chosen host cells. A variety of suitable media are known in the iart and 
generally include a carbon source, a nitrogen source, essential amino acids, vitamins 
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and minerals, as well as other components, e.^., growth factors or serum, that may be 
required by the particular host cells. The growth medium will generally select for cells 
containing the DNA construcl(s) by, for example, drug selection or deficiency in an 
essential nutrient which is complemented by the selectable marker on the DNA 
5 construct or co-lransfccied with the DNA construct. 

Suitable growth conditions for yeast cells, for example, include culturing 
in a chemically defined medium, comprising a nitrogen source, which may be a non- 
amino acid nitrogen source or a yeast extract, inorganic salts, vitamins and essential 
amino acid supplements at a temperature between 4°C and 37°C, with 30**C being 

10 particularly preferred. The pH of the medium is preferably maintained at a pH greater 
than 2 and less than 8. more preferably pH 5-6. Methods for maintaining a stable pH 
include buffering and constant pH control. Preferred agents for pH control include 
sodium hydroxide. Preferred buffering agents include succinic acid and Bis-Tris 
(Sigma Chemical Co., St, Louis, Mo.). Due to the tendency of yeasi host cells to 

1 5 hyperglycosylale heterologous proteins, it may be preferable to express the nucleic acid 
molecules of the present invention in yeast cells having a defect in a gene required for 
asparagine-l inked glycosylation. Such cells are preferably grown in a medium 
containing an osmotic stabilizer. A preferred osmotic stabilizer is sorbitol 
supplemented into the medium at a concentration between 0.1 M and 1.5 M, preferably 

20 at 0.5 M or 1.0 M. 

Cultured mammalian cells are generally cultured in commercially 
available serum-containing or serum-free media. Selection of a medium and growth 
conditions appropriate for the particular cell line used is well within the level of 
ordinary skill in the art. 

25 

Antibodies 

Antibodies to the AD4 proteins discussed above may readily be prepeu-ed 
given the disclosure provided herein. Such antibodies may, within certain 
embodiments, specifically recognize wild-type AD4 protein rather than mutant AD4 

30 protein, mutant AD4 protein rather than wild-type AD4 protein, or equally recognize 
both wild-type and mutant AD4 protein. Within the context of the present invention, 
antibodies are understood to include monoclonal antibodies, polyclonal antibodies, anii- 
idiotypic antibodies, antibody fragments (e.j?.. Fab, and F(ab*)2, Fy variable regions, or 
complementarity determining regions). As discussed above, antibodies are understood 

35 to be specific against an AD4 protein if it binds with a Ka of greater than or equal to 
I0~^M, preferably greater than of equal to 10"^M. The affinity of a monoclonal 
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antibody or binding partner can be readily determined by one of ordinary skill in the art 
(see Scatchard, Ann, N. K Acad Sci. 57:660-672, 1949). 

Briefly, polyclonal antibodies may be readily generated by one of 
ordinary skill in the art from a variety of warm-blooded animals such as horses, cows, 
5 various fowl, rabbits, mice, or rats. Typically, an AD4 protein or unique AD4 peptide 
of 13-20 amino acids (preferably conjugated to keyhole limpet hemocyanin by cross- 
linking with glutaraldehyde) is utilized to immunize the animal through intraperitoneal, 
intramuscular, intraocular, or subcutaneous injections, an adjuvant such as Freund's 
complete or incomplete adjuvant. Following several booster immunizations, samples 

10 of serum are collected and tested for reactivity to the AD4 protein. Particularly 
preferred polyclonal antisera will give a signal on one of these assays that is at least 
three times greater than background. Once the titer of the animal has reached a plateau 
in terms of its reactivity to the protein, larger quantities of antisera may be readily 
obtained either by weekly bleedings, or by exsanguinating the animal. 

1 5 Monoclonal antibodies may also be readily generated using conventional 

techniques {see U.S. Patent Nos. RE 32,011, 4,902,614, 4,543,439, and 4,411,993 
which are incorporated herein by reference; see also Monoclonal Antibodies, 
Hybridomas: A New Dimension in Biological Analyses, Plenum Press, Kennett, 
McKeam, and Bechtol (eds.), 1980, and Antibodies: A Laboratory ManuaL Harlow and 

20 Lane (eds ). Cold Spring Harbor Laboratory Press, 1988. which are also incorporated 
herein by reference). 

Briefly, within one embodiment a subject animal such as a rat or mouse 
is injected with an AD4 protein or portion thereof as described above. The protein may 
be admixed with an adjuvant such as Freund's complete or incomplete adjuvant in order 

25 to increase the resultant immune response. Between one and three weeks after the 
initial immunization the animal may be reimmunized with another booster 
immunization, and tested for reactivity to the protein utilizing assays described above. 
Once the animal has reached a plateau in its reactivity to the injected protein, it is 
sacrificed, and organs which contain large numbers of B cells such as the spleen and 

30 lymph nodes are harvested. 

Cells which are obtained from the immunized animal may be 
immortalized by transfection with a virus such as the Epstein-Barr virus (EBV) (see 
Glasky and Reading, Hybridoma 8(4):271'3^9, 1989). Alternatively, within a preferred 
embodiment, the harvested spleen and/or lymph node cell suspensions are fused with a 

35 suitable myeloma cell in order to create a "hybridoma" which secretes monoclonal 
antibody. Suitable myeloma lines include, for example, NS-1 (ATCC No. TIB 18), and 
P3X63 - Ag 8.653 (ATCC No. CRL 1580). 
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Following the fusion, the cells may be placed into culture plates 
containing a suitable medium, such as RPMI 1640, or DMEM (Dulbecco's Modified 
Eagles Medium) (JRH Biosciences, Lenexa, Kansas), as well as additional ingredients, 
such as Fetal Bovine Serum (PBS, /.c, from Hyclone, Logan, Utah, or JRH 
5 Biosciences). Additionally, the medium should contain a reagent which selectively 
allows for the growth of fused spleen and myeloma cells such as HAT (hypoxanthine, 
aminopterin, and thymidine) (Sigma Chemical Co., St. Louis, Missouri). After about 
seven days, the resulting fused cells or hybridomas may be screened in order to 
deicmiinc the presence of antibodies which arc reactive against an AD4 protein. A 

10 wide variety of assays may be utilized to determine the presence of antibodies which 
arc reactive against the proteins of the present invention, including for example 
countercurrenl immuno-electrophoresis, radioimmunoassays, radioimmunoprecipita- 
tions, enzyme-linked immuno-sorbcnt assays (ELISA), dot blot assays. Western blots, 
immunoprecipitation, inhibition or competition assays, and sandwich assays {sec U.S. 

15 Patent Nos. 4,376,110 and 4,486,530; see also Antibodies: A Laboratory ManuaL 
Harlow and Lane (eds.). Cold Spring Harbor Laboratory Press, 1988). Following 
several clonal dilutions and reassays, a hybridoma producing antibodies reactive against 
the AD4 protein may be isolated. 

Other techniques may also be utilized to construct monoclonal 

20 antibodies {see William D. Huse etal., "Generation of a Large Combinational Library 
of the Immunoglobulin Repertoire in Phage Lambda." Science 246:1275-1281, 
December 1989; see also L. Sastry et al., "Cloning of the Immunological Repertoire in 
Escherichia coli for Generation of Monoclonal Catalytic Antibodies: Construction of a 
Heavy Chain Variable Region-Specific cDNA Library," Proc. Nad. Acad. Sci. USA 

25 ^(5/5728-5732, August 1989; see also Michelle Alting-Mecs et al.. "Monoclonal 
Antibody Expression Libraries: A^ Rapid Alternative to Hybridomas," Strategies in 
Molecular Biology i:l-9, January 1990; these references describe a commercial system 
available from Stratacyte, La JoUa, California, which enables the production of 
antibodies through recombinant techniques). Briefly, mRNA is isolated from a B cell 

30 population, and utilized to create heavy and light chain immunoglobulin cDNA 
expression libraries in the MmmunoZap(H) and A.lmmunoZap(L) vectors. These 
vectors may be screened individually or co-expressed to form Fab fragments or 
antibodies {see Huse el al., supra; sec also Sastry et al., supra). Positive plaques may 
subsequently be converted to a non-lylic plasmid which allows high level expression of 

35 monoclonal antibody fragments from E, coli. 

Similarly, portions or fragments, such as Fab or Fv fragments, of 
antibodies may also be constructed utilizing conventional enz^'malic digestion or 
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recombinant DNA techniques to incorporate the variable regions of a gene which 
encodes a specifically binding antibody. Within one embodiment, the genes which 
encode the variable region from a hybridoma producing a monoclonal antibody of 
interest are amplified using nucleotide primers for the variable region. These primers 
5 may be synthesized by one of ordinary skill in the art, or may be purchased from 
commercially available sources. Stratacyte (La Jolla, Calif ) sells primers for mouse 
and human variable regions including, among others, primers for V^g, Vj^^* Vj.,c, V^dt 
^Hl' regions. These primers may be utilized to amplify heavy or light chain 

variable regions, which may then be inserted iiuo vectors sucli as InmmnoZAP*'^" H or 

10 ImmunoZAP^ L (Stratacyte), respectively. These vectors may then be introduced into 
E. coll, yeast, or mammalian-based systems for expression. Utilizing these techniques, 
large amounts of a single-chain protein containing a fusion of the and Vl domains 
may be produced (see Bird et aL, Science 242:423-426, 1988). In addition, such 
techniques may be utilized to change a "murine" antibody to a "human" antibody, 

15 without altering the binding specificity of the antibody. 

Once suitable antibodies have been obtained, they may be isolated or 
purified by many techniques well known to those of ordinary skill in the art (see 
Antihodies: A Laboratory ManuaL Harlow and Lane (cds.), Cold Spring Harbor 
Laboratory Press, 1988). Suitable techniques include peptide or protein affmity 

20 columns, HPLC or RP-HPLC, purification on protein A or protein G columns, or any 
combination of these techniques. 

Antibodies of the present invention have many uses. For example, 
antibodies may be utilized in flow cytometry to sort cells bearing such an AD4 protein. 
Briefly, in order to detect the protein or peptide of interest on cells, the cells arc 

25 incubated with a labeled monoclonal antibody which specifically binds to the protein of 
interest, followed by detection of the presence of bound antibody. These steps may also 
be accomplished with additional steps such as washings to remove unbound antibody. 
Labels suitable for use within the present invention are well known in the art including, 
among others, flourescein isothiocyanate (FITC), phycoerythrin (PE), horse radish 

30 peroxidase (HRP), and colloidal gold. Particularly preferred for use in flow cytometr>'. 
is FITC, which may be conjugated to purified antibody according to the method of 
Keltkamp in "Conjugation of Fluorescein Isothiocyanate to Antibodies. I. Experiments 
on the Conditions of Conjugation,'* Immunology 75:865-873, 1970. {See also 
Keltkamp. "Conjugation of Fluorescein Isothiocyanate to Antibodies. ILA 

35 Reproducible Method," Immunology 75:875-881. 1970; Coding, "Conjugation of 
Antibodies with Fluorochromes: Modification to the Standard Methods," J. Immunol. 
Methods 75:21 5-226, 1 970.). 
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Assays 

Assays useful within the context of the present invention include those 
assays for delecting agonists or antagonists of AD4 protein activity. Other assays are 
5 useful for the screening of peptide or organic molecule libraries. Still other assays are 
useful for the identification and/or isolation of nucleic acid molecules and/or peptides 
within the present invention, or for diagnosis of a patient with an increased likelihood 
of contracting Alzheimer's Disease. 

10 A. Nucleic Acid Based Diagnostic Tests 

Briefly, another aspect of the present invention provides probes and 
primers for detecting the AD4 genes and/or mutants thereof. In one embodiment of this 
aspect, probes are provided that are capable of specifically hybridizing to RNA or DNA 
from the AD4 gene. For purposes of the present invention, probes are "capable of 

15 hybridizing" to AD4 genes DNA or RNA if they hybridize to an AD4 gene under 
conditions of either high or moderate stringency (.vec Sambrook et aL. supra) but not 
significantly or deteciably to the AD3 gene. Preferably, the probe may be utilized to 
hybridize to suitable nucleotide sequences under high stringency conditions, such as 5x 
SSPE, Ix Denhardl's solution (Sambrook et aL, it/pra), 0.1% SDS at 65°C and at least 

20 one wash to remove excess probe in the presence of 0.2x SSC, 1 x Denhardt's solution, 
0.1% SDS at 65**C. Except as otherwise provided herein, probe sequences are designed 
to allow hybridization to AD4 genes, but not to DNA or RNA sequences from other 
genes, such as AD3. The probes are used, for example, lo hybridize to nucleic acid that 
is present in a biological sample isolated from a patient. 1 he hybridized probe is then 

25 delected, thereby indicating the presence of the desired cellular nucleic acid. 
Preferably, the cellular nucleic acid is subjected lo an amplification procedure, such as 
PCR, prior to hybridization. Alternatively, the AD4 gene may be amplified and the 
amplified product subjected to DNA sequencing. Mutants of AD4 may be detected by 
DNA sequence analysis or hybridization with allele-specific oligonucleotide probes 

30 under conditions and for time sufficient to allow hybridization to the specific allele. 
Typically, the hybridization buffer and wash will contain letramethyl anrmionium 
chloride or the like {see Sambrook et al., supra). 

Nucleic acid probes of the present invention may be composed of either 
deoxyribonucleic acids (DNA), ribonucleic acids (RNA), nucleic acid analogues {eg., 

35 peptide nucleic acids), or any combination thereof, and may be as few as about 12 
nucleotides in length, usually about 1 4 to 18 nucleotides in length, and possibly as large 
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as the entire sequence of an AD4 gene. Selection of probe size is somewhat dependent 
upon the use of the probe, and is within the skill of the art. 

Suitable probes can be constructed and labeled using techniques that are 
well known in the art. Shorter probes of, for example, 12 bases can be generated 
5 synthetically and labeled with ^^P using T4 polynucleotide kinase. Longer probes of 
about 75 bases to less than L5 kb are preferably generated by, for example, PGR 
amplification in the presence of labeled precursors such as la'^^?]dCJ?, digoxigenin- 
dUTP, or biotin-dATP. Probes of more than K5 kb are generally most easily amplified 
by transfecting a cell with a plasmid containing the relevant probe, growing the 

10 transfected cell into large quantities, and purifying the relevant sequence from the 
transfected cells. (See Sambrook et al., supra.) 

Probes can be labeled by a variety of markers, including for example, 
radioactive markers, fluorescent markers, enzymatic markers, and chromogenic 
markers. The use of ^-P is particularly preferred for marking or labeling a particular 

1 5 probe. 

It is a feature of this aspect of the invention that the probes can be 
utilized to detect the presence of AD4 mRNA or DNA within a sample. However, if 
the nucleic acid is present in only a limited number, then it may be beneficial to amplifS' 
the relevant sequence so that it may be more readily detected or obtained. 

20 A variety of methods may be utilized in order to amplify a selected 

sequence, including, for example, RNA amplification (see Lizardi el al., 
Bio/Technology d: 1 197-1202, 1988; Kramer et al.. Nature iiP:401-402, 1989; Lomcli 
etal., Clinical Chem. 55(9): 1826- 1 831, 1989; U.S. Patent No. 4,786,600), and DNA 
amplification utilizing LCR or Polymerase Chain Reaction ("PGR") (see. U.S. Patent 

25 Nos. 4,683,195, 4,683,202, and 4,800,159) (see also U.S. Patent Nos. 4,876.187 and 
5,01 1,769, which describe an alternative detection/amplification system comprising the 
use of scissile linkages), or other nucleic acid amplification procedures that are well 
within the level of ordinary skill in the art. With respect to PGR. for example, the 
method may be modified as known in the art. Transcriptional enhancement of PGR 

30 may be accomplished by incorporation of bacteriophage T7 RNA polymerase promoter 
sequences in one of the primary oligonucleotides, and immunoenzymatic detection of 
the products from the enhanced emitter may be effected using anti-RNA:DNA 
antibodies (Blais, Appi Environ. Microbiol 60:348-352, 1994). PGR may also be used 
in combination v^rith reverse dot-blot hybridization (lida et al., FEMS Microbiol. Leti 

35 1 7^:167-172, 1993). PGR products may be quantitatively analyzed by incorporation of 
dUTP (Duplaaetal., AnaL Biochem. 2/2:229-236, 1993), and samples may be filter 



BNSOOCID: <WO 970319ZA3. IA> 



wo 97/03192 PCT/IIS96/! 1386 

28 



sampled for PCR-gene probe detection (BejetaL, Appl. Environ. Microbioi 
57:3529-3534, 1991). 

Within a particularly prefcncd embodiment, PGR amplification is 
utilized to detect AD4 DNA. Briefly, as described in greater detail below, a DNA 
5 sample is denatured at 95°C in order to generate single-stranded DNA. Specific 
primers are then amicaled to the single-stranded DNA at 37*'C to 70°C. depending on 
the proportion of AT/GC in the primers. The primers are extended at 72**C with Taq 
DNA polymerase in order to generate the opposite strand to the template. These steps 
constitute one cycle, which may be repeated in order to amplify the selected sequence. 

10 Within an alternative preferred embodiment, LCR amplification is 

utilized for amplification. LCR primers arc synthesized such that the 5' base of the 
upstream primer is capable of hybridizing to a unique base pair in a desired gene to 
specifically detect an AD4 gene. 

Within another preferred embodiment, the probes arc used in an 

1 5 automated, non-isotopic strategy wherein.largct nucleic acid sequences arc amplified by 
PGR. and then desired products are determined by a coiori metric oligonucleotide 
ligation assay (OLA) (Nickerson et al., Proc. NatL Acad Set. USA 57:8923-8927, 
1990). 

Primers for the amplification of a selected sequence should be selected 

20 from sequences that are highly specific to ADA (an not AD3) and form stable duplexes 
with the target sequence. The primers should also be non-complementarA', especially at 
the 3' end, should not form dimers with themselves or other primers, and should not 
form secondary structures or duplexes with other regions of DNA. In general, primers 
of about 18 to 20 nucleotides are preferred, and can be easily synthesized using 

25 techniques well known in the art. PGR products, and other nucleic acid amplification 
products, may be quantitated using techniques known in the art (Duplaa et al., AnaL 
Biochem. 2/2:229-236, 1993; Higuchi et al., Bio/Technology 7/: 1026-1030). 

Within a particularly preferred embodiment, restriction enzyme analysis 
is used lo detect a mutant AD4 gene. For example, within one embodiment the N14 1 1 

30 mutation in AIM creates a Sau3A I restriction site. cDNA, genomic DNA, or amplified 
AD4 cDNA or genomic DNA is restricted with Sau3A I. If the A D4 gene contains the 
N141I alteration, the5fl«3A I fragment or amplified product containing codon 141 will 
be cleaved into two smaller fragments. The cleavage is readily detected by 
electrophoresis in agarose or aery I amide gels and visualized by ethidium bromide 

35 fluorescence or probe hybridization using AD4 gene fragments or oligonucleotides. 
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B. Antibodv-bascd Diagnostic Tests 

Still another aspect of the present invention provides antibodies, as 
discussed above, for the detection of AD4 gene products in diagnostic tests. A variety 
of assays can be utilized in order lo detect antibodies that specifically bind to the 
5 desired protein or peptide. Exemplary assays are described in detail in Antibodies: A 
Laboratory Manual, Harlou' and Lane (eds.). Cold Spring Harbor Laboratoo' Press, 
1988. Representative examples of such assays include: countercurrent Immuno- 
electrophoresis (CIEP), radioimmunoassays, radioimmunoprccipilations, enzyme- 
linked immuno-sorbent assays (ELISAX dot biol assays, inhibition or competition 

10 assays, and sandwich assays, immunostick (dipstick) assays, simultaneous 
immunoassays, immunochromatographic assays, immuno filtration assays, latex bead 
agglutination assays, immunofluorescent assays, biosensor assays, and low-light 
detection assays {see U.S. Patent Nos. 4,376, 110 and 4,486.530: see also Antibodies: A 
Laboratory ManuaL supra). 

15 A fluorescent antibody test (FA-test) uses a fluorescent I y- labeled 

antibody able to bind to one of the proteins of the invention. For detection, visual 
determinations are made by a technician using fluorescence microscopy, yielding a 
qualitative result. In a preferred embodiment, this assay is used for the examination of 
tissue samples and histological sections. 

20 In latex bead agglutination assays, antibodies to one or more of the 

proteins of the present invention are conjugated to latex beads. The antibodies 
conjugated lo the latex beads are then contacted with a sample under conditions 
permining antibodies lo bind to desired proteins in the sample, if any. The results are 
then read visually, yielding a qualitative result. In a preferred embodiment, this format 

25 can be used in the field for on-site testing. 

Enzyme inununoassays (EIA) include a number of different assays able 
to utilize the antibodies provided by the present invention. For example, a 
heterogeneous indirect EIA uses a solid phase coupled with an antibody of the 
invention and an affinity purified, anti-IgG immunoglobulin preparation. Preferably, 

30 the solid phase is a polystyrene microti ter plate. The antibodies and immimo globulin 
preparation are then contacted with the sample under conditions permitting antibody 
binding, which conditions are well known in the art. The results of such an assay can 
be read visually, but are preferably read using a spectrophotometer, such as an ELISA 
plate reader, to yield a quantitative result. 

35 An alternative solid phase EIA format includes a plastic-coated ferrous 

metal beads able to be moved during the procedures of the assay by means of a magnet. 
Yet another alternative is a low-light detection immunoassay format. In this highly 
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sensitive formal, the light emission produced by appropriately labeied bound antibodies 
are quanlitated automatically. Preferably, the reaction is performed using microliter 
plates. 

in a capture-antibody sandwich enzyme assay, the desired protein is 
5 bound between an antibody attached to a solid phase, preferably a polystyrene 
microliter plate, and a labeled antibody. Preferably, the results are measured using a 
speclropholometer, such as an ELISA plate reader. 

In an alternative embodiment, a radioactive tracer is substituted for the 
enzyme mediated detection in m EIA to produce a radioimmunoassay (RIA). 

In a sequential assay format, reagents are allowed to incubate with the 
capture antibody in a step wise fashion. The test sample is first incubated with the 
capture antibody. Following a wash step, an incubation with the labeled antibody 
occurs, in a simultaneous assay, tlie two incubation periods described in the sequential 
assay arc combined. This eliminates one incubation period plus a wash step. 
15 A dipstickyimmunostick format is essentially an immunoassay except 

that the solid phase, instead of being a polystyrene microliter plate, is a polystyrene 
paddle or dipstick. Reagents are the same and the format can either be simultaneous or 
sequential. 

In a chromatographic strip lest format, a capture antibody and a labeled 

20 antibody are dried onto a chromatographic strip, which is typically nitrocellulose or 
nylon of high porosity bonded to cellulose acetate. The capture antibody is usually 
spray dried as a line at one end of the strip. At this end there is an absorbent material 
that is in contact with the strip. At the other end of the strip the labeled antibody is 
deposited in a manner that prevents it from being absorbed into the membrane. 

25 Usually, the label attached to the antibody is a latex bead or colloidal gold. The assay 
may be initialed by applying the sample immediately in front of the labeled antibody. 

Immunofiltration/immunoconcentration formats combine a large solid 
phase surface with directional flow of sample/reagents, which concentrates and 
accelerates the binding of antigen to antibody. In a preferred format, the test sample is 

30 preincubated with a labeled antibody then applied to a solid phase such as fiber fillers 
or nitrocellulose membranes or the like. The solid phase can also be precoated with 
latex or glass beads coated with capture antibody. Detection of anaiyte is the same as 
standard immunoassay. The flow of sample/reagents can be modulated by either 
vacuum or the wicking action of an underlying absorbent material. 

33 A threshold biosensor assay is a sensitive, instrumented assay amenable 

to screening large number of samples at low cost. In one embodiment, such an assay 
comprises the use of light addressable polentiometric sensors wherein the reaction 
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involves the detection of a pH change due to binding of the desired protein by capture 
antibodies, bridging antibodies and urease-conjugated antibodies. Upon binding, a pH 
change is effected that is measurable by translation into electrical potential (^ivolts). 
The assay typically occurs in a ver\' small reaction volume, and is very sensitive. 
5 Moreover, the reported detection limit of the assay is 1 ,000 molecules of urease per 
minute. 

C. Other Assays 

Transmembrane receptors are involved in many cellular communication 

10 process and have been the targets of numerous pharmacologic screening assays for the 
identification and development of new therapeutic agents. Many of these screening 
assays look for ligand induced changes in cell lines expressing the recombinant 
receptor. In some cases second messengers are assayed directly while in others, 
receptor is trans fected into a cell line carrying a reporter gene construct whose 

1 5 expression level can be influenced (positively or negatively) by functional activation of 
the receptor. One conunon result of the stimulation of many different second 
messenger systems is transient changes in intracellular calcium homeostasis, this can 
be the result of Ca-^+ release from various intracellular compartments or from the influx 
of extracellular calcium. 

20 Calcium transients offer a highly sensitive and selective method for 

characterization of AD4 gene function. Expression of recombinant AD4 in cell lines 
previously transfected with an aequonn reporter construct can be used to screen for and 
identify an AD4 ligand. Aequonn is a 21 kDa photoprotein that upon Cat-^ binding 
undergoes an irreversible reaction with the production of light in the visible range. 

25 Because the fractional rate of acquorin consumption is proportional in the physiological 
fCa++], it has been used for many years as a sensitive indicator of intracellular calcium. 
More recently, several different aequonn cDNA*s have been engineered which allow 
selective targetting of aequonn expression to different intracellular compartments, 
including the cytoplasm, the nucleus and the endoplasmic reticulum. This allows for a 

30 variety of second messenger coupled pathways/companments to be screened. 

Threefore, within one embodiment a cell line which expresses mutant 
AD4 may be utilized in order to identify compounds which modify the mutant protein 
function in a way that mimics wild-type AD4 activity. 

35 Labels 

AD4 proteins, nucleic acid molecules which encodes such proteins, anti- 
AD4 protein antibodies and agonists or antagonists, as described above and below, may 
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be labeled with a variety of molecules, including for example, fluorescent molecules, 
toxins, and radionuclides. Representative examples of fluorescent molecules include 
fluorescein, Phycobili . prolems, such as phycoerythrin, hodamine, Texas red and 
luciferase. Representative examples of toxins include ricin, abrin diphtheria toxin, 
5 cholera loxin, gelonin, pokeweed antiviral protein, tritin, Shigella toxin, and 
Pseudomonas exotoxin A. Representative examples of radionuclides include Cu-64, 
Ga.67, Ga-68, Zr-89, Ru-97, Tc-99m, Rh-105. Pd-109, In-1 1 K M23, 1-125. 1-I31, Re- 
186, Re-188, Au-198, Au-199, Pb-203, Al-211, Pb-2I2 and Bi-212, In addition, the 
antibodies described above may also be labeled oi conjugated to one partner of a ligaiid 
to binding pair. Representative examples include avidin-biotin, and ribonavin-riboflavin 
binding protein. 

Methods for conjugating or labeling the ADA proteins, nucleic acid 
molecules which encodes such proteins, anti-An4 protein antibodies and agonists or 
antagonists, as discussed above, with the representative labels set forth above may be 

15 readily accomplished by one of ordinary skill in the art (see Trichothecene Antibody 
Conjugate. U.S. Patent No. 4,744,981,; Antibody Conjugate, U.S. Patent No. 
5,106,951 : Fluorogenic Materials and Labeling Techniques, U.S. Patent No, 4,018,884; 
Metal Radionuclide Labeled Proteins for Diagnosis and Therapy, U.S. Patent No. 
4,897,255; and Metal Radionuclide Chelating Compounds for Improved Chelation 

20 Kinetics, U.S. Patent No. 4,988,496; see also Inman, Methods In Enzymolo^v, Vol. 34, 
Affinity Techniques, Enzyme Purification: Part Jakoby and Wilchek (eds.). 
Academic Press, New York, p. 30, 1974; see also Wilchek and Bayer, "The Avidin- 
Biotin Complex in Bioanalytical Applications," Anal. Biochem. / 7/: 1-32. 1988). 



25 Pharmaceutical Compositions 

As noted above, the present invention also provides a variety of 
pharmaceutical compositions, comprising one of the above-described AD4 proteins, 
nucleic acid molecules, vectors, antibodies, host cells, agonists or antagonists, along 
with a pharmaceuiically or physiologically acceptable carrier, excipicnts or diluents. 

30 Generally, such carriers should be nontoxic to recipients at the dosages and 
concentrations employed. Ordinarily, the preparation of such compositions entails 
combining the therapeutic agent with buffers, antioxidants such as ascorbic acid, low 
molecular weight (less than about 10 residues) polypeptides, proteins, amino acids, 
carbohydrates including glucose, sucrose or dextrins, chelating agents such as EDTA, 

35 glutathione and other stabilizers and excipienls. Neutral buffered saline or saline mixed 
with nonspecific serum albumin are exemplary appropriate diluents. 
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In addition, the pharmaceutical compositions of the present invention 
may be prepared for administration by a variety of different routes, although 
intracranial routes are typically preferred. In addition, pharmaceutical compositions of 
the present invention may be placed within containers, along with packaging material 
5 which provides instructions regarding the use of such pharmaceutical compositions. 
Generally, such instructions will include a tangible expression describing the reagent 
concentration, as well as within certain embodiments, relative amounts of excipient 
ingredients or diluents (e.g., water, saline or PBS) which may be necessary to 
rca)nslilule llie pharmaceutical composition, 

10 

Methods of Treating or Preventing Al2he[mer Disease 
The present invention also provides methods for treating or preventing 
Alzheimer Disease, comprising the step of administering to a patient a vector {e.g.^ 
expression vector, viral vector, or viral particle containing a vector) or nucleic acid 
15 molecules alone, as described above, thereby reducing the likelihood or delaying the 
onset of Alzlieimer's Disease. 

Similarly, therapeutic peptides, peptidomimeiics, or small molecules 
may be used to delay onset of Alzheimer's disease, lessen symptoms, or halt or delay 
progression of the disease. Such therapeutics may be tested in a transgenic animal 
20 model that expresses mutant protein, wild-type and mutant protein, or in an in vitro 
assay system. 

One such in vitro assay system measures the amount of amyloid protein 
produced- Briefly, by way of illustration, a cell expressing both AD4 gene product and 
amyloid is cultured in the presence of a candidate therapeutic molecule. The AD4 

25 protein expressed by the cell may be either wild-type or mutant protein. In either case, 
the amount of amyloid protein that is produced is measured from cells incubated with 
or without (control) the candidate therapeutic. Briefly, by way of example, cells arc 
labeled in medium containing ^^S-methionine and incubated in the presence (or 
absence) of candidate therapeutic. Amyloid protein is detected in the culture 

30 supematent by immunoprecipitation and SDS-PAGE electrophoresis or by ELISA. A 
statistically significant reduction of amyloid protein compared to the control signifies a 
therapeutic suitable for use in preventing or treating Alzheimer's disease. 

Alternatively, transgenic animals expressing Alzheimer's disease protein 
may be used to test candidate therapeutics. Amyloid protein is measured or, if the 

35 animals exhibit other disease symptoms, such as memory or learning deprivation, an 
increase in memory or learning is measured. Memory and learning are tested in rodents 
by the Morris water maze (Stewart and Morris in Behavioral Neuroscience, R. Saghal, 
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Ed. (IRLPrcss, 1993, p. 107) and the Y-maze (Brils et al.. Brain Res. Bull. 6:1\, 1981). 
Therapeutics are administered to animals prior to testing. The response time in trials 
are measured and an improvement in memory and learning is demonstrated by a 
statistically significant decrease in the timed trials. 
^ As noted above, the present invention provides methods for treating or 

preventing Alzheimer's disease through the administration to a patient of a 
therapeutically effective amount of an antagonist or pharmaceutical composition as 
described herein. Such patients may be identified through clinical diagnosis based on 
symptoms of dementia or learning and memory loss which are not attributable to other 

10 causes. In addition, patients are also identified through diagnosis of brain atrophy as 
determined by magnetic resonance imaging. 

Cognitive behavior in AD may be measured by any one of several tests 
{See Gcrshon et al., Clinical Evaluation of Psychotropic Drugs: Principles and 
Guidelines. Prien and Robinson (eds ). Raven Press, Ltd.. New York, 1994, p. 467). 

15 One such test, BCRS, is designed to measure only cognitive functions: concentration, 
recent memory, past memory, orientation, functioning, and self-tare. This test, as well 
as the Weschler Memory Scale and the Alzheimer's Disease- Associated Scale, may be 
used 10 determine improvement following therapeutic treatment. "Improvement" in 
Alzheimer's disease is present if there is a statistically significant difference in the 

20 direction of normality in the Weschler Memor>' Scale test. For example, test results of 
the performance of treated patients as are compared to members of the placebo group or 
between subsequent tests given to the same patient. Improvement within the present 
invention also encompasses a delay in the age of onset of Alzheimer's disease. 

As will be evident to one of skill in the art, the amount and frequency of 

25 administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of the patient, and so forth. 
Typically, the compositions may be administered by a variety of techniques, although 
intra-cronial routes are often preferred. 

Within other embodiments of the invention, the. vectors which contain or 
.30 express the nucleic acid molecules which encode the AD4 proteins described above, or 
even the nucleic acid molecules themselves may be administered by a variety of 
alternative techniques, including for example administration of asialoosomucoid 
(ASOR) conjugated with poly (L-lysine) DNA complexes (Cristano et al., PNAS 
92122-92126, 1993), DNA linked to killed adenovirus (Curiel et al.. Hum. Gene Ther. 

35 i(2):147-154, 1992), cyiofectin-mediated introduction (DMRiE-DOPE, Vical, Calif), 
direct DNA injection (Acsadi el al.. Nature ii2:815-818, 1991); DNA ligand (Wu et 
al., J. o/Biol. Chem. 26^:16985-16987, 1989); lipofection (Feigner et al„ Proc, Natl. 
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Acad Sci. USA ^^:7413-7417, 1989); liposomes (Pickering et aL, Circ. ^9(1):13-2U 
1994; and Wang et al., PNAS 84:1^5\-nS5. 1987); microprojcctile bombardment 
(Williams el aL, PNAS 88:2126'2730, 1991); and direct delivery of nucleic acids which 
encode the AD4 protein itself either alone (Vile and Hart, Cancer Res. 53: 3860-3864, 
5 1993), or utilizing PEG-nucIeic acid complexes. 

The following examples are offered by way of illustration, and not by 
way of limitation. 
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EXAMPLES 
EXAMPLE 1 

Genetic Mapping of the AD locus from the Volga German Kindred 

5 

The Volga German kindreds are a group of 7 related families with 
autosomal dominant early-onset Alzheimer's disease (AD). These families come from a 
group of Germans, who, in the 1760's, emmigrated to the Volga river region of Russia. 
The VG remained culturally dislincl from the surrounding Russian population and has 

10 apparently been genetically relatively isolated until this centur>'. In 1987, Bird and 
collegues (Bird, el al., Ann. Neurol. 2i;25-31, 1988) identified 5 VG kindreds with 
early-onset AD. Subsequently 2 additional early-onset VG families were identified 
(Bird, el al., Ann. Neurol. 25:12-25, 1989). in addition, other families with early-onset 
disease have been identified in which, similarly to the VG kindred, the APP gene and 

15 the chromosome 14 locus have been excluded as harboring the causative gene. I'hesc 
families include 4 Swedish families with a mean agc-of-onset ranging from 55 to 64 
and 2 Dutch families with mean onsets of 52 and 59 years. The 7 VG families are 
dcscendents of inhabitants of 3 adjacent villages from the Volga region of Russia. 
Although a common ancestor could not be traced, the unique ethnic origin coupled with 

20 150 years of relative genetic isolation suggests that AD in these families is likely the 
result of a common genetic founder. Inheritance of AD in these kindreds appears to be 
autosomal dominant. Onset of AD in these families ranges from 40 to 81 years with 
family mean age-of-onsets ranging from 51 to 65 years (Bird ct aL, supra, Table 1). 
Numerous affected subjects in these families have been extensively characterized, both 

25 clinically and neuropathologically, and at least I affected subject from each family has 
had autopsy confirmation of the diagnosis of AD. Except for the relatively early agc- 
of-onset, AD in the VG is clinically and pathologically indistinguishable from typical 
AD. Neurofibrillary tangles, AP neuritic plaques, and other changes associated with 
AD have been observed in multiple autopsies. 
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Table 1. VG kindreds used for linkage analysis. 



F* amity 


Affected 


Autopsicd 


Aflectcds 


Total 


Mean age of onset 




(n) 


(n) 


sampled 


subjects 


-t SD.(n). range 


H 


8 


3 


4(2) 


4 


59.5 + 3.9 (6) 56-68 


HB 


23 


5t 


6<2) 


30 


60.8 + 7.1 (22) 54-75 


HD 


19 


2 


6 


20 


59.6+ 10-3 (17) 46-82 


KS 


14 


3 


8(1) 


29 


64.8 + 5.4 (Hi 55-71 


R 


20 


5 


9(3) 


37 


50.2 + 7.3 (17) 40-67 


W 


4 


2 


2 


4 


59.2+ 10.5 (4) 48-79 


WFL 


6 


2 


2 


15 


63.8 + 7.6 (6) 55-76 


Totals 


94 


22 


37 (8) 


139 


59.1 + 9.1 (86) 40-82 



1 5 * Includes I unaffected subject autopsicd. 

^ Numbers in parentheses indicates DNA samples obtained from autopsy material, either as paraffin 
blocks or frozen brain. 

The locus responsible for AD in the VG kindreds has not been 

20 previously mapped genetically. Linkage analysis, as well as mutational analysis, of the 
VG families with chiomosome 21 and chromosome 14 markers (AD3 locus and APOE 
locus) has yielded negative results. Moreover, the locas responsible for AD in the VG 
kindreds does not cosegregate with markers at the APP locus, and no mutations have 
been detected in the APP gene. 

25 Linkage analysis was used to scan the genome for the locus causing AD 

in these VG families. DNA was prepared from lymphoblastoid cell lines from 139 
individuals including 37 affected subjects. When suggestive evidence for linkage was 
found, autopsy-derived tissue, either frozen or embedded in paraffm^ was used to 
prepare DNA from 8 additional affected subjects for whom no other tissue was 

30 available. Markers on all chromosomes were genotyped and analyzed using the lod 
score method (Morton, Am. J, Hum. Genet. 7:277, 1955; Hodge, et al., Am. 1 Hum. 
Genet. Ji: 11 39. 1983). For the genome screen, evidence for linkage was evaluated 
assuming autosomal dominant inheritance with age-dependent penetrance and a 0% 
sporadic rate. Lod scores were also computed using a low (1%) penetrance model, 

35 which makes no assumption about the disease status of at-risk individuals and thus 
serves as a check that information about linkage derives primarily from the afiected 
individuals (in whom the genotype at the disease locus is more accurately known 
compared to that of at-risk subjects). Published marker allele frequencies (Genome 
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Database) were used unless critical allele frequencies were significantly lower than 
those estimated in the VG. In this case, frequencies obtained from the VG pedigrees 
using all subjects (affecteds, at-risk and spouses) were used. This conservative 
approach was taken during screening since underestimation of the frequency of an 
5 allele co-segrcgaiing with the disease can cause false positive evidence for linkage (Ott, 
Am. J, Hunt Genet. 57:283. 1992). However, this approach will underestimate lod 
scores if the true allele frequency is lower than assumed. In this study, 162 publically 
available markers were genotyped. Among these were 70 STRP markers with 
heterozygosity values mostly > 0.70 spaced > 20 cM apart. 

^0 This work demonstrates the presence of a previously unmapped AD 

locus on chromosome lq31-42. 

Initial suggestive lod scores for chromosome 1 were obtained for 
markers DIS103 (Z„„ = 1.79, 9 = .10) and D1S249 (Z^,, = 1.76, 6 = .20) (Table 2), 
which are separated by approximately 26 cM (Fig. 1 ). Subsequently, 21 other markers 

15 in this region spanning 55 cM were analyzed (Table 2). Using the conscrx'ative 
screening analysis conditions, significant evidence for linkage (Z„„ > 3.0) was obtained 
with D1S479 (Z„„ = 4.40, G = .11). This lod score was similar to that predicted by 
earlier power analyses. Most of the evidence for linkage comes from families HB and 
R, with positive lod scores also observed for families W, H, and HD (Table 3). 

20 Subsequently, a 112 bp allele of D1S479 co-segregated with the disease in 5 of 7 
families, which is consistent with a common genetic founder. Other markers in the 
region also gave positive though non-significant lod scores (e.g. D1S439, Z^^, = 2.82, 0 
= -17; D1S320, Z^ = 1.87,6 = .14; Dl SI 03, Z„„ = 2.40, 6 = .008; Table 2). 

Table 2 demonstrates that FAD is linked lo chromosome I markers. 

25 Genotypes were perfonned by conventional methods, such as by PGR. All genotypes 
for DIS479 were determined in duplicate. For markers D1S227, DIS320, D1S213, 
D1S439, D1S479, D1S225, and DIS103. genotypes were determined for 8 samples 
from affected subjects either derived from either paraffin blocks or frozen brain 
material (not all samples amplified with all markers). Lod scores were computed using 

30 the computer program LfPED (Ott, Am. J. Hum. Genet.. 2d:588, 1974) assuming age- 
dependent penetrance (Morton, Am. J. Hum. Genet., 7:277, 1955). 
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Table 2, 

Lod scores for linkage of FAD to chromosome I markers*. 







Recombination fraction m 


Locus 


Het* 


0.001 


0.05 


0.10 




A OA 

O.iU 


0.30 


0.40 


D1S238 


0.86 


-9.29 


-1.99 


-0,34 






0.81 


0.43 


D1S422 


0.76 




-1.70 


-0.28 


V.Jo 


U.o7 


0.65 


0.27 


DIS412 


0.71 


-7.82 


-1.66 


0.18 


1 00 


l.J J 


1.17 


0.54 


D1S306 


0.61 


-14.71 


-7.75 


-4.69 


-2.88 


-1. / 1 


A ^ 1 

-0.43 


0.01 


DIS310 


0.57 


-6.03 


-1.75 


-0.79 




n 

-U.uO 


0.14 


0.11 


DIS249 


0.87 


-9.76 


-1.12 


0.65 


1 44 


l./o 


1.60 


0.81 


D1S245 


0.82 


-12.15 


-2.56 




U.JO 


1.05 


1.11 


0.55 


D1S205 


0.80 


-11.66 


-3.39 






U.04 


0.38 


0.20 


D1S425 


0.81 


-8.36 


-2.38 




-n 10 


0.05 


0.27 


0.15 


D1S217 


0.66 


-3.87 


-0.97 




u. lu 


0.30 


0.32 


0.1! 


D1S237 


0.76 


-4.77 


-1.88 


-0.76 


-0.13 


0 70 


U.Jo 


0.20 


D1S229 


0.77 


-6.54 


-1.42 


-0.34 


0.12 


0.28 


0.18 


-0.08 


D1S227 


0.67 


-2.23 


0.23 


0.80 


0.92 


0.83 


0.34 


-0.07 


D1S320 


0.70 


-8.99 


-123 


0.36 


1.03 


1.27 


1.09 


0.42 


D1S2I3 


0.86 


-3.80 


-L57 


-0.80 


-0.37 


-0.13 


0.05 


0.04 


D1S479 


0.80 


-0.05 


3.91 


4.39 


4.25 


3.81 


2.50 


0.99 


DIS439 


0.80 


-5.59 


1.05 


2.36 


2.78 


2.76 


1.97 


0.73 


DIS251 


0.82 


-15.43 


-5.19 


-2.67 


-1.31 


-0.51 


0.14 


0.14 


DIS225 


0.79 


-10.44 


-2.62 


-0.48 


0.60 


1.12 


1.16 


0.52 


D1S103 


0.80 


0.33 


1.60 


1.79 


1.74 


1.57 


1.05 


0.46 


D1S459 


0.79 


-20.40 


-5.86 


-2.87 


-1.39 


-0.59 


0.02 


0.07 


D1S446 


0.69 


-12,37 


-4,50 


-1.99 


-0.71 


-0.02 


0.46 


0.32 


DIS235 


0.68 


-7.54 


-1.09 


0.27 


0.85 


1.06 


0.88 


0.34 



Lod scores were computed using published allele frequencies except for D1S479 For 
^s marker, the al ele frequency for the 1 12 bp allele was derived frl the VG subject 
^HerZ::il^'^'^''' ^•'^•^ ^^^^ '^^^ values (1 7). ^ 
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In Table 3, the family lod score for Hkage of AD lo the marker D1S479 
is presented. Conditions 1 and 2 are age-dependent penetrance with VG and CEPH 
allele frequencies respectively; conditions 3 and 4 are age-dcpendcnt penetrance with a 
sporadic correction function using VG and CEPH allele frequencies, respectively. For 
5 the H, HB» and R families, maximum lod score were obtained with D1S479. For other 
families, maximum lod values were obtained with other markers (HD: D1S103, Z„.^ = 
1.75, e = 0,001, condition 4), D1S249 (KS; D1S4I2, Z,^ = 1.20, 8 = ,10, condition 2), 
and D1S43WDIS227 (W; D1S227 and D1S439, Z^, = 0.78, 9 = OOK condition 4), No 
marker was positive for the WFL family under any condition. Under the low (1%) 
1 0 penetrance model, Z„3, values were 3 .04 (9 - . 1 0) and 1 .27 (9 = . 1 5) for the control and 
VG allele frequencies, respectively. 
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Table 3 



Family lod score for linkage of AD to D1S479. 



Family 


Condition 


0.001 


0.05 


O.IO 


0.15 


0.20 


0.30 


0.40 


H 


1 


0.45 


0.36 


0.28 


0.21 


0.15 


0.06 


0.01 




2 


0.44 


0.36 


0.28 


0.21 


0.15 


0.06 


0.01 




3 


0.90 


0.78 


0.66 


0.54 


0.42 


0.21 


0.06 




4 


0.90 


0.78 


0.66 


0.54 


0.42 


0.21 


0.05 


HB 


1 


1.67 


1.78 


I.7I 


1.54 


1.33 


0.80 


0.26 




2 


1.60 


1.72 


1.64 


1.49 


1.27 


0.76 


0.25 




3 


2.38 


2.47 


2J7 


2.18 


1.93 


1,31 


0.59 




4 


2.31 


2.40 


2.30 


2.12 


1.87 


1.27 


0.57 


HD 


1 


-K84 


-0.45 


-0.22 


-0.12 


-0.08 


-0.05 


-0.02 




2 


-0.32 


-0.13 


-0.05 


-0.02 


-0.02 


-0.03 


-0.02 




3 


-UO 


0.07 


0.28 


0.34 


0.32 


0.19 


0.07 




4 


0^2 


0.40 


0.45 


0.44 


0.38 


0.21 


0.07 


KS 


1 


-1.57 


-0.28 


-0.06 


0.03 


0.05 


0.03 


0.00 




2 


-1.40 


•0.26 


-0.05 


0.03 


0.05 


0.03 


0.00 




3 


-1.57 


-0.28 


•0.06 


0.03 


0.05 


0.03 


0.00 




4 


-1.40 


-0.26 


-0.05 


0.03 


0.05 


0.03 


0.00 


R 


I 


1.07 


2J2 


2.49 


2.40 


2-19 


1.55 


0.70 




2 


1.34 


2.51 


2.58 


2.46 


2.22 


1.56 


0.71 




3 


1.46 


2.69 


2.84 


2.73 


2.48 


1.76 


0.83 




4 


1-73 


2.88 


2.94 


2.78 


2,52 


1.78 


0.83 


W 


I 


0.67 


0.59 


OJO 


0.42 


0.34 


0.18 


0.05 




2 


0.67 


0.59 


OJO 


0.42 


0.34 


0.E8 


0.05 




3 


0.67 


0.59 


r o3o 


6.42 


0.34 


0.18 


0.05 




4 


0.67 


0.59 


0.50 


0.42 


0J4 " 


0.18 


0.05 


WFL 


1 


-0.50 


-0.41 


-0J2 


-0.24 


-0,17 


-0.07 


-0.02 




2 


-0.50 


-0.4 J 


-0.32 


-0.24 


-0.17 


-0.07 


-0.02 




3 


-0.49 


-0.40 


-03 1 


-0-24 


-0.17 


-0.07 


-0.02 




4 


-0.49 


-0.40 


•0.31 


-0.24 


-0.17 


-0.07 


-0.02 


Totals 


1 


-0.05 


3.91 


4.39 


4.25 


3.81 


2.50 


0.99 




2 


2.05 


5.92 


6.29 


5.99 


3.37 


3.61 


1.57 




3 


t.83 


4J7 


4.60 


4.34 


3.85 


2.50 


-099- 




4 


3.94 


6.39 


6.49 


6.09 


5.41 


3.60 


1.56 
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In the VG pedigrees, genotypes are not available from many deceased 
individuals v^ho are related to sampled subjects (e g. all individuals in generations I-III 
in Fig. 2). Because of this missing data problem^ marker allele frequencies can 
influence linkage analysis results. For most of the markers used, allele frequencies in 
5 the VG families were similar to those from controls. However, D1S479, in 5 of the 
families (H, HB, HD, R, and W), was substantially more frequent in the affected 
subjects (0.32 in affected subjects, 0.18 in all VG subjects,) compared to a frequency of 
0.04 in controls. Such an increase in frequency for a closely linked marker allele is 
expected. When control frequencies based on VG unaffected spouses were used for the 

10 112 bp D1S479 allele, the maximum lod score for D1S479 increased from Z^^ = 4.40 
(8 - . 1 1 ) to Z^, = 6.29 (9 = . 1 0) (Table 3). 

Linkage analysis may be affected by the presence in the pedigrees of 
phenocopies (non-AD dementia cases or AD caused by something other than the major 
gene segregating in these families). Phenocopies can reduce the power to detect 

15 linkage and increase the estimated 9 (Cavalli-Sforza and King, Am. J. Hum. Genet, 
38:599, 1986). Phenocopies are a potential problem in the VG kindreds since the range 
of onset ages for AD in these families extends up to 82 years of age (Table 1), which 
overlaps with common late-onset AD. To correct for the presence of late-onset AD 
cases in the families, a phenocopy correction model was used (Schellcnberg, et al.. Am 

20 J. Hum. Genet, iJ:6l9, 1993), which assumes that as the age-of-onsei of a subject 
increases, the probability that the case is a phenocopy is increased. Despite the fact that 
in the absence of true phenocopies this model reduces the power to detect linkage, for 
all markers analyzed, the lod scores increased (Margaritte.et al., Am. 1 Hum, Genet. 
50:\22\, 1992). For D1S479, Z„„ values increased from 6.29 to 6.51 and from 4.40 to 

25 4.60 using the control and VG allele frequency for the 112 bp allele, respectively. 
Under this model, the maximum D1S479 lod score for a single family for D1S479 was 
2.95 (9 = .09) for the R family. These results support the hypothesis that at least some 
of the AD cases in these families are probably phenocopies. 

Haplotypes were constructed for the R family using the 23 markers 

30 spanning approximately 55 cM of chromosome 1 (Fig. 7). A common haplotype 
between D1S439/D1S479 and D1S306 was observed in all but 1 of the affected 
subjects (IV- 1, Fig. 7), This individual did not share the disease haplotype across the 
entire region, had an age-of-onset of 67 years, which is greater than 2 SD above the 
family mean, and had an e4/e4 genotype at the APOE locus. This APOE genotype is 

35 associated widi lower age-of-onset for late-onset AD (Corder, et ai.. Science 267:921, 
1993). Thus, this subject may be a phenocopy. In other families, definite assignment 
of a common disease haplotype was difficult because of missing data, and will require 
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extensive multipoint analysis. However, for D1S479, the 112 bp allele segregated with 
the disease in the H, HB, HD, W, and R pedigrees, in the HD family, the 1 affected 
subject who did not have the 1 12 bp allele, had an age-of-onset of 75 years and was e3/ 
e4 at the APOE locus. However, age-of-onset alone could not be used to 
5 unambiguously determine whether a subject is a phenocopy. In the HB family, subjects 
with onsets of up to 75 years shared the DIS479 112 allele with other younger affected 
subjects. The KS family is the only VG kindred in which no affected subject had the 
D1S479 112 bp allele. Analysis of this family is complicated because of the late age- 
of-onset (64.8 years) and because of the 8 affected subjects, 3 were AFOE i:4/t;4 

10 homozygoies (onset ages of 57, 58, and 67 years) and 4 were e4 heterozygotes (onset 
ages 67, 68. 68 and 71 years). Thus, the KS family could be an example of genetic 
heterogeneity within the VG kindreds, although group heterogeneity tests did not delect 
heterogeneity. In summary, the finding of the same rare allele segregating with AD in 
most of the VG pedigrees supports the hypothesis of a common genetic founder for 

1 5 most of the families. As described herein, the sequence data from VG patients confirms 
this hypothesis. 

Taken together, these data demonstrate the presence of an AD locus at 
lq3l-42. Analysis of recombinants in the R family define the candidate region as 
between D1S225 (subjects VI-3 and V-3 in Fig. 2) and DIS2I7 (subject IV- 16), a 
20 region spaiuiing 14 cM (Fig. 8), 

EXAMPLE 2 

Cloning of the AD4 gene from chromosome I using YACS 

25 YAC clones derived from chromosome 1 were obtained from the CEPH-Genclhon 
library and were identified for the region spanning D1S229 to D I SI 03 by 2 different 
methods. 

In the first method, the CEPH-Genethon YAC library was arrayed for 
PCR-based screening. This publically available library (Bellanne-Chantelol, et al.. Cell 

30 70, 1059-1068, 1992) was obtained from Glen Evans (Salk Institute) as 21,792 clones 
in 227 96-well plates. For DNA preparation, colonies were grown separately in 1.5 ml 
liquid media, harvested, and pooled, Spheroplasts were made by digestion with 
lyticase. DNA was purified by phenol/chloroform/isoamyl alcohol extractions and 
precipitated by the addition of sodium acetate and ethanol. After resuspension and 

35 RNase treatment, the DNA was ready for PGR amplification. 

A two-dimensional YAC pooling scheme allowed the identification of 
specific YAC addresses by PCR screening (Amemiya, et al., Nucl. Acids. Res. 
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20:2559-2563, 1992). The library was subdivided into five sets (4 x 48 and 1 x 35 
trays). Duplicates v^ere made of each set. One replica of each of the 5 sets was used 
for the first dimension basic horizontal pools (each pool represents all clones from a 
single 96-welI plate). The second replica was used for second dimension basic vertical 
5 pools (pools containing clones from a specific well position such as Al for plates from 
a set). The pools within each set were further combined to facilitate J'CR screening. 
For each set, horizontal pools were combined to generated to 4 super-horizontal pools. 
Each super-horizontal pool contained clones from 12 consecutive plates (plates 1-12, 
13-24, 25-36, 37-48). The second dimension was combined to produce 8 super-vertical 

10 pools per set. Each super vertical pool contained clones from each row in a set (A, B, 
C,...etc,). Hence the entire library was reduced to 19 super horizontal pools (4X4 sets 
+ 3X1 set), and 40 super vertical pools (8X5 sets). In this way, candidate clones can 
be found after two rounds of PCR i.e. a first round with super pools and a second round 
with the basic pools. When multiple hits occur, a final round of PCR is necessarv'. 

^5 The YAC library was screened using primers for the STS D1S320 and 

D1S479 as shown in Figure 9. This screening yielded the following YACs: 920e7, 
859b6,904O. 

In the second method, a data base was queried. 71ie CEPH-Genethon 
INFO clone data base contains information concerning the genetic marker and STS 

20 content of YACs in the CEPH-Genethone YAC library. This database was queried 
with the following markers: DIS229, DIS227, D1S213, D1S479, D1S439, DIS225, 
D1S103, DIS495. The YACs identified were 857h3. 934r2, 957gl0, 881b8, 9l3cl0, 
8IOe5, 921dl2, 865c8, 920e7, 859b6, 736b6, 746d7, 748h9, 748e2, 753a5, 753d8, 
753d9, 753f9, 753fl2, 753h3, 757d8, 759bl0, 757(9, 76lhl0, 767bl, 767dK 767gl, 

25 769ni, 786a8, 786fll, 820g5, 824g8, 824g9, 824gn, 826b3, 826b5, 83le4. 849a7, 
856a7, 880dl2, 897fi), 899fL 899hl, 920gl, 9040, 915e8. 922al0, 933a8, 958el0, 
978gl0, and 9790. Some of these YACs contain the marker used in the query and 
some are connected to YACs which contain the marker used in the database query. A 
total of 60 YACs were identified by the combined methods. 

30 YAC DNA was prepared from each YAC clone. The YAC DNA was 

then amplified using the primers shovwi in Figure 9. A YAC contig map partially 
covering the region from D1S229 to D1S103 was constructed. 

EST T03796 (Figure 10), was identified in the EST GenBank database 
as a sequence with homology to the chromosome 14 AD gene clones both at the 

35 nucleotide level and the amino acid level. Oligonucleotide primers, EPIP and EPIR, 
and ICM7749 and KM7750 were used for PCR amplification (Figure 1 1). Given the 
EST sequence, the expected size was approximately 140 bp (for primers EPIF and 
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EPIR) or 190 bp (for primers KM7749 and KMllSO). When human genomic DNA 
was amplified, a 1,300 bp band was observed. Amplification of the YACs from the 
chromosome 1 Volga German gene region with the same primers yielded a 2.5 kb 
fragment from four different isolates of YAC 921dl2. One isolate of 92Idl2 did not 
yield this fragment, but it was smaller and presumably deleted for the 2.5 kb fragment. 
YAC 92ldl2 has been previously shown to contain the markers D1S439 and D1S479. 

In addition, a panel of rodent-human hybrid cell lines obtained from the 
Coriell Institute for Medical Research were also amplified with the same primers. 
These cell lines contain one or a few human chromosomes on a rodent background 
(either mouse or hamster). Only cells containing human chromosome 1 had the 2.5 kb 
band. 

DNA from YAC 921dI2 was amplified with primers EPIF and EPIR 
and subsequently with WWF and WWR (Figure 11) to produce a 2.5 kb fragmcnt. 
DNA sequence analysis of this 2.5 kb fragment was performed with primers WWF and 
WWFIL (Figure 2) to give sequence ELLl.l; (Figure 10) this sequence contained at its 
5' end a portion of the EST T03796. This 2.5 kb fragment was also sequenced with 
primers WWR and WWRl R yielding the sequence ELLl .2, which also contained a part 
of the DNA sequence from T03796. The 2.5 kb band from YAC 921dl2 was 
reamplified, purified and sequenced using the primers: WWF, WWFIL, WWR and 
WWRIR. The sequence of these DNA molecules is shown in Figure 10. In this 
fragment, 119 bp on the 5' and and 69 bp on the 3' end were identical to the EST 
sequence. In addition, a splice acceptor and donor sequence were observed at both 
junctions of the EST sequence, suggesting that the intervening sequence is an intron of 
the chromosome 1 gene. Thus, EST T03796 is on YAC 921D12, and is a candidate for 
containing at least part of the Volga German AD gene. 

cDNAs encoding the AD gene were isolated as follows. One million 
clones from a Werner syndrome fibroblast cDNA library or B616 human brain cDNA 
library were plated and transferred to Duralon membranes (Stratagene). The filters were 
prc-hybridized for one half hour in Church-Gilbert Hybridization solution (0.5M Na 
phosphate, pH7.0. 7% (w/v)SDS, 1% (w/v)BSA) at 65'' C and screened using 500,000 
cpm/ml of a ] 40 base pair PGR product amplified from the EPl primers (140 bp of the 
EST T03796) or a 190 bp PGR product amplified from KM7749 and KM7750 primers. 
Probe was labeled using a random primed method (Doehringer-Maimheim) for half an 
hour and unincorporated nucleotides were removed by a Microspin s-200 HR column 
(Pharmacia). The filters were hybridized for 16 hours and then washed 1 5 minutes each 
at 65^ C in 2X SSC/0.l%SDS followed by a wash in O.IX SSC/0.1% SDS. 
Autoradiographs were exposed for 16 hours and four primary candidates from the 
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fibroblast library and five primaries from the brain library' were isolated. The primary 
candidates were positive upon rescreening. Secondary positives were picked into SM 
buffer and rescued using published protocols (Stralagene). 

In addition, 600,000 clones in an arrayed version (Munroc^et al., Froc. 
5 Nati Acad. Sci USA P2:2209-2213, 1995) of the above fibroblast cDNA library were 
screened using PGR methods. The EPl and WW! primer pairs were used to screen the 
plate pools (1-48). The EPl primer pair gave positives in plates i K 1 7 and 36. while the 
WWl primers gave positives in plates 17 and 36. When PGR was done to determine 
column and row addresses only two addresses could be determined: plate i 7 G7 (both. 

10 primer pairs) and 17 CI! (WWl only). Confirmalor>' PGR on the individual wells 
showed product for only 17C1 1. Both wells were plated for lcrtiar>' screening to obtain 
plaque purified phage. The EPl PGR product was used for screening as above; positive 
plaques were isolated for 17 Cll and were rescued as above. Positive plaques 
eventually purified were designated 1 1. ISA, 15B, and Gl U PS2-6, PS2-I0, and PS2- 

15 41. 

The sequence fi*om chromosome 1 was then used to screen several 
cDNA libraries. Positive clones 2.2 kb in length were isolated and sequenced. The 
sequence, shown in Figure 1, shows clear overiap with the ends of the PGR fragment. 
Additional EST sequences with overlapping sequence, and additional cDNA sequences 
20 were obtained by further library screening. 

EXAMPLE 3 
Mutations in Affected VG patients 

Using RT-PGR, the entire gene has been amplified from both affected 
VG individuals and unaffected individuals. The unaffected individuals are from VG 
families and from unrelated lineages. The sequences obtained have been analyzed. In 
affected individuals, there is a nucleotide change in codon 141 (AAG-^ATG) resulting 
in an amino acid alteration from asparagine to isoleucine (N141I), No such change was 
found in any unaffected individuals of comparable age. The amino acid sequence in the 
region of the alteration is LLNSVLNTLIMISVIVVMTI in the unaffected and 
LLNSVUTLIMISVIVVMTI in the affected individuals examined. Affected subjects 
carrying the N141I mutation were identified in 5 (H, HB, HD, R, W) of the 7 VG 
families initially screened. All subjects with the Ml 411 mutation also carried the 
D1S479 1 12 bp allele (allele H, Fig. 7). Subsequently, 2 additional VG families (E and 
BE) were screened for the NI41I mutation. In both kindreds, the affected subjects 
carried the N141I mutation and the D1S479 112 bp allele. In total, the N1411 mutation 



30 
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was directly observed in 20 affected subjects and inferred to be present from 
spouse/children genotypes in an additional 6 AD subjects; onset ages for the 26 subjects 
ranged from 44 to 75 years. The N14il nnutation was also observed in 15 at-risk 
subjects (blood relatives of affected N1411 carriers) with ages ranging from 23 to 67 
5 years. No VG spouse (n = 17) had the N141( mutation. Additional populations 
screened included 84 normal Caucasians, spouses, at-risk subjects and affected subjects - 
from 67 late-onset familial FAD families (n = 223 affected subjects), and 48 affected 
sporadic subjects from an AD clinic population. 

In addition to the N 14 II mutation, three other variants were identified at 
10 nucleotide 436 (C->T, subject HB), 496 (C->T, subjects HD and W) and 628 (C->T, 
subjects HD and W). These variants did not change the amino acid encoded at these 
sites (alanine, asparagine, and histidine, respectively). 

EXAMPLE 4 

15 Characterization of the AD4 Gene and its Protein Product 

The sequence of the VG from chromosome 1 was constructed from 
sequence of several clones as described. The sequence is shown in Figure 1. The 
amino acid sequence deduced from this DNA sequence is shown in Figure 2. 

20 Comparison of this sequence with the sequence of the AD3 gene from chrornosome 1 4 
shows sequence homology. An alignment of the proteins and genes as constructed 
using the program GAP from the Wisconsin package of sequence analysis software 
(Vcr, 8.0,1 - UNIX, September 1994) is shown in Figure 3. The amino acid sequence 
of AD4 exhibits approximately 67% identity to the AD3 gene product. Some notable 

25 characteristics of the AD4 gene product are shown in Figure 5. 

The protein encoded by the chromosome 1 AD gene is a membrane 
protein, as judged by the seven hydophobic segments presumed to be membrane 
spanning regions (Figure 4). The AD4 protein is also referred to as STM2. For STM2, 
all the putative transmembrane helices are capped at the car boxy 1-terminal end by a 

30 lysine residue, found either at the very end or within a few residues of the end of the 
transmembrane domains. The N141I mutation, a change from a hydrophilic asparagine 
residue to a hydrophobic isoleucine residue, is at a position directly adjacent to the first 
predicted transmembrane domain. 

A northern blot (Clontech) containing 2 pg of human poly A-plus 

35 MRNA derived from heart brain placenta, lung, liver, skeletal muscle, and pancreas 
was hybridized with the 2,3 kb AD4 cDNA clone as a probe. Hybridization was 
performed in 0.75M NaCl. 0.05M NaH2P04, 5mM NajEDTA, iOX Denhardt's 
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solution, 100 microgranVml salmon sperm DNA, 50% formamide, 2% SDS at 42°C for 
18 hours. Filters were washed with O.IXSSC, 0.1% SDS at 65°C. Northern blot 
analysis of RNA from a variety of human tissues revealed two AD4-related transcripts 
which were approximately 2.4 and 2.8 kb in size. The larger transcript appears to be 
5 primarily expressed in placenta, skeletal muscle and heart, while the smaller transcript 
is detected in heart, brain, placenta, liver, skeletal muscle and kidney but not lung. A 
sequence-tagged site (STS) from the 3* untranslated region was used to confirm that 
AD4 maps to chromosome 1 and to YAC 92Idl2. The (wo transcripts observed arc 
possibly the result of alternative polyadenylation since two putative polyadenylaiion 

10 signals were identified in the 3'untranslaled region, at nucleotides 1834 and 2309. The 
message sizes for STM2 from Northern blot analysis (2.4 and 2.8kb) are slightly larger 
than the 2.3 kh size predicted from the ftill-length DNA sequence and the 1,8 kb size 
predicted by the second polyadenylation site observed in the .sequence. This apparent 
discrepancy may be due to the anomalous migration of size markers which has been 

1 5 previously observed with this type of Northern blot. 

The normal cellular function of STM2 is unknown. If this protein is 
functionally similar to spe4, the C elegans gene with sequence similarity to STM2, it 
may play a role in the cytoplasmic partitioning of proteins. Mutations in STM2 could 
alter intracellular protein trafficking of APP and ultimately lead to altered APP 

20 processing and increased production of Apl-40 or Apl-42 (amyloid p protein). 
Consistent with this notion, recent work with fibroblast cell lines from AD3 carriers has 
shown that secretion of AP ending at position 42 is increased relative to non-carrier cell 
lines. Alternatively, STM2 may function as a G-protein coupled receptor or as an ion 
channel. 



25 



EXAMPLE 5 
Cloning o f AD gene on chromosome 1 



The gene may be obtained from genomic or cDNA libraries or from 
30 genomic DNA or mRNA. The nucleotide sequence of the gene provided herein may be 
used to design oligonuclotides. The primers are optimally 18-25 nucleotides in length 
and preferably contain not more than 50% AT base pairs, do not end in A. and should 
not have self-complementary regions. 

The oligonucleotides may be used as hybridization probes on cDNA or 
35 genomic libraries. For cDNA libraries, mRNA from a specific source, such as a cell 
line or tissue sample, is transcribed into cDNA and inserted into an appropriate vector 
(e.^,, Xgi\0, XZAP, pBS, or pSPORT). (see Sambrook et al.. Molecular Cloning, Cold 
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Spring Harbor, 1989). The recombinant vector is introduced into an appropriate 
bacterial strain {e.g,, DH5a or KW257) and the cells plated on growth medium. 
Alternatively, cDNA libraries representing RNA from various sources are obtained 
commercially (e.g.. Clontech Laboratories). In either case, colonies (for plasmids) or 
5 plaques (for phage) arc transferred to a nitrocellulose or nylon membrane and prepared 
for hybridization (Sambrook, supra). Oligonucleotide probes are labeled at their 5' end 
with using T4 polynucleotide kinase. Hybridization and washes are performed 
under standard conditions for the length and GC content of the probe (see Sambrook et 
al., supra). 

10 For isolation of genomic clones containing the AD4 gene, genomic 

libraries are constructed by standard methods (Sambrook el al., supra) or obtained from 
commercial sources (Clontech, Stratagene, Research Genetics and Genome Systems). 
Libraries are constructed in phage, cosmids, PL BACs» or YACs and probed as 
described for cDNA libraries. Colonies or phages that hybridize are picked, purified 

1 5 and grown for preparation of DNA. 

If a full-length sequence is not obtained by the cloning and screening 
methods described, full-length sequences may be obtained by screening additional 
clones, by RACE {Frohman and Martin, Proc. Nail. Acad ScL USA ^5:8998-9002, 
1988), ligation-anchored PGR (Troutt et ah, Froc. Natl. Acad Sci. USA 59:9823-9825, 

20 1992) or other similar methods. Verification that a clone contains AD4 sequence may 
be obtained by determining the DNA sequence of the clone and comparing it to the 
sequence of AD4 provided herein. Sequence identity of 90% or better indicates that the 
clone encodes AD4. Some mismatch may be due to polymorphisms or DNA sequence 
errors. 

25 

EXAMPLE 6 
Detection of mutations 

Mutations in the AD4 gene may be determined by nucleotide sequence 
30 analysis. Peripheral blood cells are obtained from patients by venipuncture and 
hypotonic lysis of erythrocytes. DNA or mRNA is isolated from these cells and the 
AD4 gene isolated by amplification. Oligonucleotide primers are chosen based on the 
cDNA sequence provided herein or genomic sequence. A computer program such as 
Primer 2.2, which is publically available from the web-server at the Whitehead^IT 
35 Center for Genome Research, assists in choosing oligonucleotide primers. If mRNA is 
used as a source of nucleic acids> first strand cDNA if synthesized by standard methods. 
Amplification of DNA is performed and the DNA sequence of the PGR product is 
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determined. A comparison of the sequence obtained from the patient and from a 
normal control allows identification of mutations. 

In addition, the N141I mutation in AD4 gene creates a Satt3A I 
restriction site, which may be used to rapidly screen for mutant alleles. Genomic DNA 
5 or cDNA may be directly screened for the Sau3A I restriction site or amplified by PCR 
using primers which flank the 141 codon. DNA or amplified DNA is resuicted with 
Sau3A I and electrophoresed in an appropriate percentage agarose gel. The presence of 
the Saii3A I site is detected by ethidium bromide fiuorescence for amplified products or 
by probe hybridization following transfer of the gel to nylon membrane. Suitable 

10 probes are oligonucleotides or DNA fragments containing sequence near or 
encompassing the 141 codon. Preferably, in the case of an oligonucleotide probe, there 
is at least 24 bases that do not span codon 141 . 

As shown in Figure 12, the NMIl mutation is delectable after 
amplification. Approximately 60-100 ng of genomic DNA was amplified with primers 

15 WWF and INTIR (intronic primer 5-GTGGGGCAGACGGAGAGAA-3') at 40 ng 
each, L5mM MgCKand 200 iiMdNTPs. Amplification was performed with an initial 
incubation at 95"* C for 3 min. followed by 35 cycles of 45 sec. at 94° C, 45 sec. at 
60^ C. and 45 sec. at 72° with a final incubation for 10 min. at 72^ The 160 bp PCR 
product (in 20 ul) was digested with 10 U San2A and the fragments resolved by 

20 electrophoresis in a non-denaturing 12% polyacrylamidc gel. Fragments were 
visualized by ethidium bromide staining. The N 1411 mutation results in fragments of 
102 bp, 58 bp, 43 bp and 15 bp (not shown) while non-carriers have only the 102 and, 
58 bp fragments. Lanes arc: 1, at-risk, 79 years; 2, affected, onset 60 years; 3, at-risk, 
67 years of age; 4, affected, onset 75 years (presumed phenocopy); 5, Boehringer 

25 Mannheim DNA size standard V; 6, affected, onset at 52 years; 7, affected, onset 46 
years; 8. affected, onset 49 years. 

EXAMPLE 7 

Determination of the DNA Sequence of the AD4 eenomic region 

30 

A 2.5 kb fragment of DNA from the AD4 gene was amplified from YAC 
921 dl2 DNA using primers WWF and WWR and radioactively labeled by nick- 
translation (Sambrook, et al. supra), A human genomic clone hybridizing lo this 
fragment was identified in a commercially available library (Genome Systems PI 
35 library). The hybridization was carried out according to the manufacturers instructions. 
PCR amplification using primers specific for the 5* and 3' ends of the AD4 cDNA 



BNSDOCID; <WO 9703 1 92A3_(A> 



wo 97/03192 PCT/US96/1 13«6 

51 

confirmed that the PI clone (clone #6004 in the nomenclature of Genome Systems) 
carried the entire AD4 gene. 

DNA from PI clone 6004 was sheared and subcloned into M13mpI8 
vector prepared by digestion with Smal and dephosphorylalion. The vector was 
5 purchased from Novagen (Catalog # 69996-1 ) and the fragments were cloned according 
to the manufacturers instructions. Approximately 1600 clones were isolated. The 
sequence of the insert was determined by fluorescent detection on an AB1373A 
according to the manufacturers instructions. The data was assembled using the 
program Xgap (Sladen, R. (1994) The Staden package. In "Methods in niolecuiai- 
10 biology" A.M. Griffin and H.G. Griffm, Eds., Vol 25, pp. 9-170, Humana Press). 
Exons were identified by entering the cDNA sequence into the assembly- 

The sequences of the 12 exons and the flanking iniron sequence of the 
AD4 gene are shown in Figures 13 to 19. The relative positions of the exons in these 
sequences are listed in Table 4. 



15 



Table 4 

Position of exons in the genomic sequence 





Exon 


Genomic 


Exon 


Exon 


20 




seauence 


position 


leneth 




exoni 


stm.genA 


546 


143 




exon2 


stm.genB 


850 


186 




exon3 


stm.genC 


327 


16i 




exon4 


stm.genC 


2140 


215 


25 


cxon5 


stm.genD 


168 


142 




exon6 


stm.genE 


808 


68 




exon? 


slm.genE 


1555 


221 




exon8 


stm.genF 


102 


99 




exon9 


slm.genF 


1345 


84 


30 


exon 1 0 


Stm.genF 


1809 


102 




exonl 1 


stm.genG 


1053 


117 




exon I 2 


stm.genG 


2470 


683 



BMSOOCID: <WO 9703192A3_IAs- 




PCT/US96/n386 



EXAMPLE 8 

Isolation of the gene fr om patient DNA and detection of polymorphisms. 

Treatment of Alzheimer's disease is more effective if begun prior to the 
5 onset of neural degeneration. However, current diagnostics only effectively identify 
patients after substantial loss of cognitive function. Basically, loss of cognitive 
function in an older patient without any detectable cause is the primary indicator of 
Alzheimer's disease. Strict confinmation of the diagnosis is made by autopsy. Any 
treatment which blocks the initiation or progression of Alzheimer's disease will be 

1 0 much more effective if an earlier diagnosis is made. 

One dominant mutation of the AD4 gene is known to cause early-onset 
Alzheimer\s disease. Other mutations in this gene or in other members of the same 
gene family may cause or contribute to Alzheimer's disease. We provide herein 
methods for detection of mutations in the AD4 gene. The same techniques will allow 

1 5 detection of mutations in other members of the gene family. 

Ten pairs of oligonucleotide primers used to PGR amplify ten fragments 
carrying the 12 AD4 exons are listed in Table 5. DNA is isolated from patient blood 
samples and individual PGR reactions carried out with each set of primers using the 
Perkin-Elmcr PGR kit according to the manufacturers instructions in a Perkin-Elmer 

20 9600 PGR machine with the following cycle conditions: 2 minutes at 92°G. 35 cycles 
of 20 seconds at 92°C, 45 seconds at the indicated annealing temperature, and 3 
minutes at 72°G, followed by 7 minutes at 72^C. PGR fragments are purified by gel 
filtration in a 96-well formal spin column with Sephracyl-500HR resin (Wang, Gan 
Boysen and Hood, Anal. Biochem. 226: 85-90, 1995). Mutations are detected by 

25 fluorescent dye terminator sequencing on the ABI373A using the sequencing primers 
specific for each exon as listed in Table 6. Many other mutation detection formats are 
possible using the PGR amplified fragments. 

A diagnostic test will have several uses in addition to the eventual use of 
targeting patients for early intervention with Alzheimer's drugs. First, clinical trials of 

30 any Alzheimer's drug would benefit from monitoring the allelic variations in genes 
which predispose towards the disease. The genetic background of the patients may be 
correlated with different mechanisms for initiation of the disease: a drug which is 
effective against one disease mechanism may not be effective against disease caused by 
other mechainsms. Second, the diagnostics may be used to choose the appropriate 

35 treatments if clinical trials show that different genetic backgrounds are correlated with 
the efficacy of drugs. 
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Table 5 

Primers for PCR of exon fragments 



Exons 






PCR product 


PCR conditions 


AnncaJing 


ampUficd^ 


Primer 


Sequence (5'-3*) 




Mga2 {m^f} 


temperature ("C) 


1 


INT105L 


GGCGTTTTGncnCTTCTCTC 


956 


I.O 


60 




{NT106R 


ACCAAAGGGTACAGAGTTTTCCC 








2 


IKTI08L 


CATCCCCAAGTGGTATGTGnA 


610 


1.0 


60 




IKTI07R 


aCCCCAGCCGTGCnGTG 








3 


IMTIllL 


CGTTrGATTGACAGGCATCTCT 


549 


1.0 


61 




INTIIZR 


CMGGTGGCAGAGTGGACAG 








4 


IKTII5L 


CATATGCCCTAGTA6CTCATAG 


596 


0.75 


56 




lhmi6R 


CCAHATACGAACAAGGAAGCTG 








5 




GCTCATGGGGATG6TGCCC 


1149 


1-0 


60 




INTIMR 


6AAGCCGCTGTCTAAAAGCTTAT 








6.7 


[NTIQ2L 


CaiCCCAGCGTAGGCATGAA 


1200 


2.0 


62 




INTIOIR 


GCAGAAAGGGATGCAAGCCC 








8 




GGCATGaCTGAGAGCTCCA 


572 


1.0 


60 




INTI09R 


GCCCAGAAGCTCCTT6CTCC 








9, 10 


INTIIOL 


TGGACCCCTCCCACAACGG 


868 


1.0 


60 




INTI04R 


TCACAGAGATGCCTCT6ATGG 








It 


INTI17L 


ACTffiTTrTACCTGAGmrCAGA 


568 


1.0 


58 




INTU8R 


CGAGCACACCTCCT6T6AG 










I>rr7L 


GGTGTCTAGCACCGnATCC 


803 


2.0 


60 




DMO1009 


ACAGGACTCATCTATTTATG6ATA 
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Table 6 

Primers for sequencing exons from PCR templates 



5 



Exon 


Primer seouence 


Primer Position 


Exon I 




808 


Exon 2 


CAATTGTCTGGGTATCA 


800 


Exon 3 


CTTTGTCTCACAGGAAA 


266 


Exon 4 


CGTGCATTACATGGATA 


2062 


Exon 5 


GTCCAGAATCACTCAAG 


79 


Exon 6 


ATCAGTCTCAGGATCCT 


719 


Exon 7 


GCTGACTGTCTGGAACT 


1584 


Exon 8 


CCAGTCAACTCTGAAAG 


288 


Exon 9 


CTTGCTCAATCTTCAAG 


1582 


Exon 10 


CCTGGTAACACTCTGAC 


1708 


Exon i i 


TCCACATCTTAGCTTCTAG 


912 


Exon 12, 5* 


CTAGACCATGACTCACAG 


2397 


Exon 12,3' 


AGAGGGGTCAACATTTAC 


3179 



20 

EXAMPLE 9 
Expression of STM7 

expression pattern of STM2 was examined in multiple tissues by 
Northern blot analysis (Fig. 20). A probe was generated by amplifying the STM2 
cDNA clone using primers ADC22R2 and ADC22L5, Identical patterns were obtained 
with probes made using primer pairs DM01 01 8 and DM0 1021 or DM0 1 009 and 
DMO1014. A 2.4 kb message corresponding to STM2 was observed in all tissues 

30 examined, including all regions of brain sampled. (In kidney, liver and lung, longer 
exposures of the same Northern blot showed STM2 to be present at low levels in these 
tissues; data not shown). High expression levels were observed in the pancreas, heart, 
and skeletal muscle. In these three tissues, a second 2.8 kb transcript was observed. 
From the STM2 cDNA sequence, two potential polyadenylation signals were identified 

35 at nucleotides 1834 and 2309 (total clone length, 2167 bp). To determine whether the 
2.4 and 2.8 kb transcripts could represent different length 3* UTR polyadenylation 
variants, RNA- blots were hybridized with a probe scanning the entire 3* UTR and with 
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a probe slatting after the first polyadenylation site. Both probes gave identical patterns, 
indicating that only the second polyadenylation site was used. 

In an attempt to identify homologues of STM2, liver and pancreas 
cDNA was amplified with degenerate primers based on the STM2 sequence. Twenty- 
5 two MI3 recombinant clones were identified and sequenced, all corresponding to 
STM2. Seven of the 22 clones had the sequence GAGATGGAAGAC beginning at 
nucleotide 1327 in the cDNA sequence, which is 3 bp shorter than the previously 
reported sequence {AGATGG|AAGAAGAC, where 'T' indicates the junction between 
exons 9 and 10). The shorter transcript encodes a predicted protein one amino acid 

10 shorter (EMED) versus EMEED) with no frameshift. The shorter transcript is likely a 
result of alternative splicing of exon 10, with the first coding AG dinucleotide used as 
the splice acceptor sequence or a polymorphism. To distinguish between these 
possibilities, RNA and DNA were prepared from a lymphoblastoid cell line from a 
normal (non-AD) VG subject in the HD family. Both transcripts were present in RT- 

15 PGR products using primers EX103L and EX102R from exons 9 and 10. However, 
when the intron 9 to exon 10 region of genomic DNA from the same subject was 
sequenced, a single sequence that corresponds to the genomic sequence was obtained. 
Thus the two different transcripts arise from alternative splicing. Both transcripts were 
also observed in leukocytes, skeletal muscle, and one fetal brain library' (Clontech), 

20 with the longer transcript being more abundant. However, in another fetal brain librar> 
(Stratagene), only the longer transcript was obser\'ed, while in a foreskin fibroblast 
library, only the shorter transcript was found. 

Table 7 

25 Primers for RNA analysis 





Primer 


Location 


Primers (5'-3') 


PGR product 


Conditions 


Annealing 










size (kb) 


MgCl2 


temperature (*C) 


1 


EXI03L 


Exon 9 


CTGCCATGGTGTGGACGGTT 


170/173 


1.5 


60 




EXi02R 


Exon 10 


CTCCAGCTCCTCCCCTGGG 








2 


ADC22R2 


3'tn-R 


G6ACAAAGCATTGG6AACACT 


408 


1.5 


56 




ADC22L5 


3'IJTR 


TGAGGGACATGGTGTGCCACAGG 








3 


DMOI014 


3ljrR 


GAACTGAGAAGGTCAGAHA 


26fl 


1.5 


56 




DMOI009 


3'tJTR 


ACAGGACTCATCTAHTATCCATA 








4 


DMOI018 


3'UTR 


ATTGGATGCAGTT6TATAGT 


229 


1.5 


56 




DMOI02I 


3'UTR 


TAATCTGACCnCTCAGHC 
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From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except a.s by the appended claims. 
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Claims 



We claim: 

1 . An isolated nucleic acid molecule encoding an AD4 gene product. 

2. The isolated nucleic acid molecule of claim 1 wherein said molecule is 
set forth in Figure I . 

3. The isolated nucleic acid molecule of claim I wherein said nucleic 
acid molecule encodes a mutant AD4 gene product that increases the probability of 
Alzheimer's Disease. 



4. The isolated nucleic acid molecule of claim 3 wherein said nucleic 
acid molecule encodes a mutant AD4 gene product with an amino acid subsituiion at residue 
141. 

5. The isolated nucleic acid molecule of claim 4 wherein said substiiution 
at residue 141 is an aspargine to isoieucine substitution. 

6. An isolated nucleic acid molecule selected from the group consisting 

of; 

(a) an isolated nucleic acid molecule as set forth in Figure 1 or 
complemcnlary sequence thereof; 

(b) an isolated nucleic acid molecule that specifically hybridizes to 
the nucleic acid molecule of (a) under conditions of high stringency; and 

(c) an isolated nucleic acid that encodes an AD4 gene product. 

7. An expression vector, comprising a promoter operably linked to a 
nucleic acid molecule according to any one of claims 1 to 6. 

8. The expression vector according to claim 7 wherein said promoter is 
selected from the group consisting of CMV I-E promoter, SV40 early promoter and MuLV 
LTR. 

9. The expression vector according to claim 7 wherein said promoter is a 
tissue-specific promoter. 
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10. A vira! vector capable of directing the expression of a nucleic acid 
molecule according to any one of claims 1 to 6. 

11. The viral vector according to claim 10 wherein said vector is selected 
from the group consisting of herpes simplex viral vectors, adenoviral vector^,, adenovirus- 
associated viral vectors and retroviral vectors. 

1 2. A host cell carrying a vector according to any one of claims 7 to 1 1 . 

13. The host cell according to claim 12 wherein said cell is selected from 
the group consisting of human cell, dog cell monkey ceil, rat cell and mouse cell. 

14. An isolated protein comprising an AD4 gene product. 

15. The isolated protein of claim 14 wherein said protein consists 
essentially of an AD4 gene product. 

16. The isolated protein of claim 14 wherein said protein has an amino 
acid sequence set forth in Figure 2. 

17. The isolated protein of claim 14 wherein said protein is a mutant AD4 
gene product that increases the probability of Alzheimer's Disease. 

18. The isolated protein of claim 17 wherein said mutant AD4 gene 
product has an amino acid subsitution at residue 141. 

19. The isolated protein of claim 17 wherein said substitution at residue 
141 is an aspargine to isoleucine substitution. 

20. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient a vector containing or expressing a nucleic acid molecule 
according to any one of claims 1, 2, or 6, thereby reducing the likelihood or delaying the 
onset of Alzheimer's Disease in said patient. 

21. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient a protein according to any one of claims 14 to 23, thereby reducing 
the likelihood or delaying the onset of Alzheimer's Disease in said patient. 
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22. A method of treating or preventing Alzheimer's Disease, comprising 
administering to a patient an antibody specific for a protein according to any one of claims 17 
to 19, thereby reducing the likelihood or delaying the onset of Alzheimer's Disease in said 
patient. 

23. The method according to any one of claims 20-22 wherein said method 
comprises in vivo administration. 

24. A pharmaceutical composition comprising a vector comprising the 
nucleic acid molecule according to any one of claims 1 and 2, along with a pharmaceutically 
acceptable carrier or diluent. 

25. A pharmaceutical composition comprising a host cell according to any 
one of claims 12 and 13, along with a pharmaceutically acceptable carrier or diluent. 

26. A pharmaceutical composition comprising a protein according to any 
one of claims 14 to 16, along with a pharmaceutically acceptable carrier or diluent. 

27. An antibody which specifically binds to the protein according to any 
one of claims 14 to 19. 

28. The antibody according to claim 27 wherein the antibody is a 
monoclonal antibody. 

29. The antibody according to claim 28 wherein the antibody is selected 
from the group consisting of an Fab fragment, an Fv fragment and a single chain antibody. 

30. A hybridoma capable of producing the antibody according to claim 28. 

31. A pharmaceutical composition comprising an antibody according to 
claim 27, along with a pharmaceutically acceptable carrier or diluent. 

32. A nucleic acid probe which is capable of specifically hybridizing to an 
AD4 gene under conditions of high stringency. 
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33. A nucleic acid probe comprising at least a portion of the nucleotide 
sequence shown in Figure 1 or its complementary sequence, said probe being capable of 
specifically hybridizing to a mutant AD4 gene under conditions of high stringency. 

34. A probe comprising at least a portion of the nucleotide sequence 
shown in Figure 1 or its complemenlar>' sequence, said probe being capable of specifically 
hybridizing to a mutant AD4 gene, in which amino acid residue 141 is changed from 
asparagine to isoleucine, under conditions of very high stringency. 

35. The probe according to any one of claims 32 to 34 wherein said probe 
comprises at least 1 2 nucleotide bases. 



36. A pair of primers capable of specifically amplifying all or a portion of 
a nucleic acid molecule according to any one claims 1 to 6. 

37. The primer according to claim 36 wherein said primer comprises at 
least 12 nucleotide bases. 



38. A method of diagnosing a patient having an increased likelihood of 
contracting Alzheimer's Disease, comprising: 

(a) obtaining from a patient a biological sample containing nucleic acid; 

(b) incubating the nucleic acid with a probe which is capable of 
specifically hybridizing to a mutant AD4 gene under conditions and for lime sufficient to 
allow hybridization to occur; and 

(c) detecting the presence of hybridized probe, and thereby determining 
that said patient has an increased likelihood of contracting Alzheimer's Disease. 

39. A method of diagnosing a patient having an increased likelihood of 
contracting AlzJieimer's Disease, comprising: 

(a) obtaining from a patient a biological sample containing nucleic acid; 
(h) amplifying a selected nucleic acid sequence associated with a mutant 

AD4 gene; and 

(c) detecting the presence of an amplified nucleic acid sequence, and 
thereby determining that said patient has an increased likelihood of contracting Alzheimer's 
Disease. 
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40. A method of diagnosing a palienl having an increased likelihood of 
contracting Alzheimer's Disease, comprising: 

(a) contacting a biological sample obtained from a patient with an 
antibody that specifically binds to a mutant AD4 protein, under conditions and for a lime 
sufficient lo allow binding of the antibody to the protein; and 

(b) detecting the presence of the bound antibody. 

41. A peptide vaccine comprising a portion of a mutant AD4 gene product 
containing a mutation, in combination with a phanuaceutically acceptable carrier or diluent. 



AD4 gene which is operably linked lo a promoter eflective for the expression of said gene, 
said gene being introduced into said animal, or an ancestor of said animal, at an embryonic 
stage. 

43. The transgenic animal according to claim 42 wherein the animal is 
selected from the group consisting of a mouse, a rat and a dog. 

44. The transgenic animal according lo claim 42 wherein AD4 is 
expressed from a vector according to any one of claims 7 to 1 L 

45. The transgenic animal according to claim 42 wherein the AD4 gene 
encodes a mutant AD4 gene product. 

46. An isolated DNA molecule comprising AD4 gene sequence presented 
in Figures 13 to 19. 



42. A transgenic animal whose germ cells and somatic cells contain an 
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FIGURE 1A 

The coding region for AD4 begins a scare codon beginning at 
nucleotide 3fi8 gcc(acg)ctc and ends with a stop codon beginning 
at nucleotide 1712 «tc(tga)ggg 

1 cgagcggcgg cggagcaggc atttccagca gtgaggagac agccagaagc 
51 aagcttttgg agctgaagga acctgagaca gaagctagtc ccccctctga 
101 atCttactga tgaagaaact gaggccacag agctaaagcg acttttccca 
151 aggtcgccca gcgaggacgt gggacttctc agacgtcagg agagtgatgt 
201 gagggagctg tgtgaccata gaaagtgacg Cgttaaaaac cagcgctgcc 
251 ctctttgaaa gccagggagc atcattcatt tagcctgctg agaagaagaa 
301 accaagtgtc cgggattcag acctctctgc ggccccaagt gtccgtggtg 
351 cttccagagg cagggctatg ctcacattca tggcccctga cagcgaggaa 
401 gaagtgtgtg atgagcggac gtccctaacg Ccggccgaga gccccacgcc 
451 gcgctcccgc caggagggca ggcagggccc agaggacgga gagaacactg 
501 -ccagtggag aagccaggag aacgaggagg acggcgagga ggaccccgac 
551 cgctatgtcc gtagtggggt tcccgggcgg ccgccaggcc tggaggaaga 
601 gctgaccctc aaatacggag cgaagcacgt gatcacgctg tttgcgcctg 
651 tcaccctgtg catgaccgtg gtggtagcca ccatcaagcc tgcgcgcttc 
701 tacacagaga agaatggaca gctcatctac acgccactca ctgaggacac 
751 accctcggtg ggccagcgcc tcctcaaccc cgtgctgaac accctcatca 
801 cgatcagcgt caccgcggtc acgaccatct tctcggtggt gccctacaag 
851 caccgctgct acaagttcat ccatggctgg ttgatcatgt cttcactgat 
901 gctgctgttc ctcttcacct atatctacct Cggggaagcg ctcaagacct 
951 acaatgtggc catggactac cccaccctct tgctgactgt ctggaacttc 
1001 ggggcagcgg gcatggtgtg catccactgg aagggccctc tggtgctgca 
1051 gcaggcctac ctcatcatga ccagtgcgct catggcccca gtgttcatca 
1101 agtacctccc agagtggtcc gcgtgggtca tcctgggcgc catctctgtg 
1151 tacgatcccg tggctgtgct gtgtcccaaa gggcctctga gaatgctggt 
1201 agaaactgcc caggagagaa atgagcccat attccctgcc ctgataCact 
1251 catccgccat ggtgtggacg gttggcacgg c^aagctgga cccctcctct 
1301 cagggtgccc tccagccccc ctacgacccg gagatggaag aagactccta 
1351 tgacagtctt ggggagcccc cataccccga agtctccgag cctcccttga 
1401 ctggczaccc aggggaggag ctggaggaag aggaggaaag gggcgcgaag 
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FIGURE 1B 



1451 cctggccccg gggactccat cttccacagt gcgctggtgg gcaaggcggc 

ISOl cgccacgggc agcggggact ggaataccac gccggcctgc cccgcggcca 

15S1 tccccaccgg cctgcgtctg accctcctgc cgcccgccgt gcccaagaag 

1601 gcgctgcccg ccccccccac ccccatcacg cccgggccca tccctcactt 

1651 ccccacggac aacctggcgc ggccgtccat ggacaccceg gccccccacc 

1701 agccccacat ctgagggaca Cggcgtgcca caggetgcaa gctgcaggga 

1751 accttcatcg gacgcagtCg tacagcctca caccccagtg ccacatatCC 

1801 tcaagacctc tcctCcctta aaaaataaag tacgtgctca ctcggtgagg 

1851 aggaggcaga accagcKCC tggtgccagc tgttccacca ccagaccctg 

1901 gcccccgccr tggggagcgc ctcgcc=cac ggacaggaag cacagcaggt 

1951 tcacccagac gaactgagaa ggccagacta gggcggggag aagagcaccc 

2001 ggcacgaggg ctgagatgcg caaagagcgr gctcgggagc ggcccccggc 

2051 acccgggtgc cccggctgga gaggaaaagc cagtccccca cgaggagcgt 

2101 tcccaaegcc C=gtccacga cgcccccgcc act=caccgc cccnagaaac 

2151 cgag=cccgc cctcgccacg gcagtcacac Cgccgggaag Cggctcaata 

2201 gcaatatcaa Caaatagatg agtcc=gcca gaaaaa 
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FIGURE 2 



Protein sequence of 448 amino acids rh=.i- _ 

of Che Human AD4 gene. '^^P^sencs the gene prcduc: 



1 K.TFMASDSE EEVCDERTSL MSAESPTPRS CQEGRCJGPED CENTAQWRSQ 

SI ENEEDGEEDP DRWCSOTPG RPPGLEEELT LKYGAKHVIM LFVPVTLOU 

XOI VWATIKSVR FVTEKNGQLI YTP5TEDTPS VGQRIXMSVL WTLIMISVIV 

ISl VMTIFLWLY ICraCYKFIHG WLIMSSLMLL FLnTIYLGE VLKTVNVAMD 

201 YPTLIXTVWN FGAVGMVCIH WKGPLVLQQA YLIMISADlA LVFIKYlPEW 

2S1 SAWVILGAIS VTOLVAVLCP KGPIJUa,VET AQEHNEPIFP ALIYSSAMVW 

301 TVGMWOJJPS SOGALOLPTO PEHEEDSYDS FGEPSYPEVF EPPLTX^PGE 

3S1 ELEEEEEHGV KLGUSDFirV SVLVGKAAAT GSGDWKTTLA CFVAILIGLC 

401 r.TIiLLAVFK KALPALPISI TFGLIFYFST DNLVRPFMDT lASHQLYI 
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FIGURE 3 



1 MLTFMASDSEEEVCDERTSUISAESPTPRSCQEGRQGPETCEOTACW^ 50 

•" ' • • I • I I • I I • ! • I : I • - I I 

1 KTELPMt^Q. .NAQMSEDimLSJmmSCJNDHRERQEHND. . . .mSL 44 

SI BSIEEBCSroPDRYVCSGVPGRPPGL^ 100 

„'""""•'" I- IIIIK-.ltll |:|:|| 11 I I III 

' ' I I ■ ' I I N I I I I I . : : I : I I I : I . : I I I I I I . I II I I I . I 
201 yPTUXTVWOT«»yG^p>CI^^ 2S0 

245 TAWtllAVISVYDLVAVLCPKGpi^^^Ai^ 294 

AD4 301 TVCaAKLDPSSQ. .GA£QLPyDPEMEE. . . .DSYDSFGEPSVPEVFEPPL 344 

I'll- ll-'l . .|::| -I |. :- ■- I 

295 I-VNMAEGDPEAQRRVSKNSKTOAESTERESQDTVAZNDIX^ 344 

345 TGTPG eeL EEEEERGVKLGLGDFIFYSVLVG 375 

345 DSHLGPHRSTPESFUAVolLssitJlG^pilii^^ 394 
376 K'^TCSGEn^ 25 
395 KASATASGDWtrrriACFVAILIGLCLTiljLii^ 444 
AD4 426 ryFSTDNLVRPFMDTIASHQLYI 448 

MM! IMlil II ||:|| 

AD3 445 FYT-ATDYLVQPFMDQLAFHQrYI 467 



A04 

AD3 

AD4 

AD3 

AD4 

AD3 

A04 

AD3 

AD4 

AD3 
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FIGURE 5 



Scquance features of the Human AD4 



gene product 



1 MLTFMASDSE EEVCDE31TSL MSAESPTPRS CQEGRQGPEa) GENTAQWRSQ 

51 ENEEDGEEDP DRTVCSGVPG RPPGLEEELT LKYGAKHVIM LFVPVTr.CMI 

101 VWATIKSVR FYTEKNGQLI rTPFTEDTPS VGQRIXNSVL NTLIMISVTV 

151 VKTIFLVVLY KYRCYKFIHG WLIMSSLMLL FLFTYIYLGE VLKTYWVAMD 

201 YPTLLLTVWM FGAVGMVCIH WKGPLVLQQA YLIMISALMA LVFrKYLPEW 



251 SAWVILGAIS VYBLVAVLCP KGPLRMLVET AQERNEPIFP ALIYSSAMVW 



301 TVGMAKLDPS SQGALQLPYD PEMEEDSTOS FGEPSYPEVF EPPLTGYPGE 
351 ELEEEEERGV KLGLGDFIFY SVLVGKAAAT GSGDWNTTLA CFVAILIGLC 



401 LTLLLLAVFK KALPALPISI TFGLIFYFST DNLVRPFMDT LASHQLYI 



Conserved tetrauner (KGPL 222-225) 



Conserved tetramer {KGPL 271-274) 



Putative iGlycosylacion Site (N 386) 
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DOTPLOT COMPARISON OF THE PRODUCTS OF HUMAN AD4 AND AD3 GENES 



HUMAN AD4 GENE PRODUCT (l TO 448) 




Fig. 6 
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FIGURE 9 



D1S103 
D1S213 

D1S227 
V1S22S 
D1S32C 
D15439 

D1S479 



ACCSUlCiTTCTACUGTTAC 
TCrcaGAGaAACTGacCTGT 

AGCrCrrTAATCGUTCTATGArCTG 

G^CTGGGTGACaAAGCA 
AGCTOTGTCGGGTCTTGA 

GaGcaArrrcAGaATCTTGc 

GCT i r-CaCTEATGGTG 
ACrCTAGrTGTGTGTGAArGTATc; 

TrTTGGCACCAGTAATATCrTG 
TTGCAGACTCTC^CrraGAACl 

Ca^CA GACTTC^TTAGAGGGG 
GTTGAAATGGTGAATTTGGA 

agctggggtgtttt;itggm 
ggggaccctstttctgcrac 

ACCCATTGCcriCCATC 
CK^GGAGArrTGGACTGG 
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FIGURE 10 

dna sequences 

788IIH5.SEQ 

1 AGTGAAATCG TCCTGTCACC ACGCGTCAAG TGCTGATGGG GCACAGCAAC 
SI rrCCGGGCCT ATCATATCTC CTTGACCTCG TCCCTCAAAT CTCGTACTTT 
101 CTGCACCGAG GGACACAGTC CACTGCGATG AAGTATGTTC AAAATC™ 
ISl TCTATAGGCA GGTCCTTCCA AAGTCCAATA GTCCAAGGTG GACGcS 

EST T03796 

1 '^GAAGGTAA GAGCTGACCC TOUUTACGG AGCGANACGT NATCATGCn^ 
51 TTTGTGCCTG TCACTCTGTG CATGATCGTC GTGGTAGCCA CCATCAAGTC 
101 TGTGCGCrrC TACACAGAGA AGAATGGACA GCTCATCTAC ACGCCATTCA 
ISl CTGAGGACAC ACCCTCGGTG GGCCAGCGCC TCCTCAACTC CGTGCTGAAC 
201 ACCCTCATCA TGATCAGCGT CATCGTGGTT ATGACCATCT TCTTGGTGGT 
251 GCTCTACAAG TACCGCTGCT ACAAGTTCAT CCATGGCTCG TTGATCATGr 
301 CTTCACTCAT GCTGCTGTTCr CTCTTCACCT ATATCTACCT TGGGGAAGTG 
3S1 CTCAAGACCT ACAATGTGGC CATCGACTAC CCCACCCTCT TGCTGACTGT 
401 CTCGGAACrr CGGGGCAGTG GGCATGGTGT GCATCCACTC AAGGGGCCrr 
451 TGGTGCTGCA GAGGCCTACC TATOv 

ELLLl DNA sequence 
ELL1.2 DNA Sequence 



agtgaaccagtagagatggggggaS^^tSSSJS??^;:^^^ 

GOUWATGACCTX^GGGGAACTCCTrGGA^CT^S^^S^*!??^ 
TAGAATTTGTGGCGCTTGGGGACACCTTGTGATCGNGMAATTTCNGTTGTCrAGNNCATcSS 
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FIGURE 11 



PCR PRIMERS 



WWF 

WWR 

EPIF 

EPIR 

WWFIL 

WWRIR 

1.3F 

1.3R 

EP2F 

EP2R 

921D12F 

921D12R 

INTIR 

ADC22L1 

ADC22R1 

KM7 749 

KM77S0 



GGA 
CAA 
CTT 
GTA 
TCC 
AGC 
ACC 
AGG 
GAG 
TGA 
TCC 
CAG 
GTG 
CCC 
ACT 
GGT 
TCA 



CAC ACC 
GGT AGA 
GGT GGT 
GTC CAT 
TCA ACT 
ATC AGT 
ACC GGG 
AAG CAG 
AAG GTA 
TAG GTA 
AGC CAC 
TAT TAG 
GGG CAG 
AAG TGT 
GTG GAC 
AGA TAT 
CTG AGG 



CTC 
TAT 
GCT 
GGC 
CCG 
GAA 
GCA 
ATC 
AGA 
GGC 
CCT 
GTC 
ACG 
TCG 
ATA 
AGG 
ACA 



GGT 
AGG 
CTA 
CAC 
TGC 
GAC 
ACC 
TCA 
GCT 
CTC 
TCT 
ATC 
GAG 
TGG 
GCG 
TGA 
CAC 



GGG 
TGA 
CAA 
ATT 
TGA 
ATG 
TGA 
ATA 
GAC 
TGC 
CTC 
CAC 
AGA 
TGC 
GTC 
AGA 
CCT 



CC 

GTA C 
GT 
AC 
ATC 

GGA 
CC 
AGC 
C 

GGT 

A 

TT 

AGG 

GG 

CG 
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FIGo 12 



BNSOCCIO: <WO 9703192A3JA> 
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FIGURE 13 

Genomic sequence of stm.genA 

gccgccgagccccggctgggggtgggcgcggcgcgagcggttaaagggccggtglal2S 
aaggagcggtgcacgtgggtctctgaggcgtgtagcaggcgggggIgftt?gt?cttcS 

^^^^^^^^^^'^^''^'^^ggggctctgcccgcacg^gagggglaggglc 
gcaggctcaagcctagagccggtttctgttagcagcggtgtttggctgttttatcllaca 
55^?f*9cagtgaggagacagccagaagcaagctlt?|gIgct|!aggaacSgaSSg 



^ff^^^r r^''''^^^''"^^'=^9^^9*«9aaactgaggccacagagctaaagtga 
gcttctagatctgctgtccatccctccgagcagacctcacccctgtttattglcttaati 



aS^^o^^S^^^^^"''!''"''^^^^^=9^^^9^^g«'=t^«'=tt:aactcagaagcgac 
agcctggggttgccaaggctgtctacgaagtaacttgattaggaccgacccclgcitica 
gtaaggaagcctctgatgcctctgtagccaattctgcagacaLtglgcctccaaggc" 
tcagccaagacctttggcggtaattggagtctcgggataagctgcltlaggtgtgtg^ 

«IIfr^^^^ ^^''''f''^'='=^=^5^5ctgcgtcctcacgacgccc:acgt?gagclt:t 
!?atat^al^?^^?^^^''''^^^^'''^'^9'="99gggcai:gttgctttgalgtgggatgg 
atgtatcaggtttttggggaaaactctgtaccctttggtgttgaagtcccatgtgica 



.9703td2A3JA> 
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FIGURE 14 



gaacgtagtctatgctggacacttcctteto^^^Z^ f f^^^^^^^^^^'^^ctttcca 
gagaatggggtgclgglggtjqqaaacaool^r 

gcgtcaccggcacLtaat5tqSc!aa?«^^r^^^''''''^'='='''="=5"'ggagagtt:catc 

gggl:ggtg!?ttttcc"ccaI|g"ScaSa^ 
gtggcttgcaaagcaggccctcIggaaSlafoa^^^ 

ctgttttccttcaggagctaggtaaat«aaIo«2^^^ ^"^^'==^*='===^9atgct 
agcctttgagtccIltctgctSccJanr^!!?^^^^^^^*=^^^^^^^^^"gtt 

t:agaggaL!tctgtcS!a^gSg"g?tlgtt!t«^ 

aaaataaaccctacagataca ^ ^'^^'^^ttattttcctttaaagaattaatagcct 
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FIGURE 15 

caccacagatgtttgctgatcgaatgaatgagcacactgacagtttggagctgccctgac 

tttcgtggctatgcgttttgccccctgggatgtgagtcacctcaggccagccccaggcaa 

ggccgctgctgcctccatggtaactctcaaggcctcttgttttatggcagtcgtttgatt 

gacaggcatctcttggaagcttttggggcaggacttgtgtccaagtctccaggtcgcctc 

cagccaccccctgagtcctccactgcctttgtctcacaggaaagtggaacaaggtccttg 

tgctcctttttccaggtgcttccagaggcagggctatgctcacattcatggcctctgaca 

gcgaggaagaagtgtgtgatgagcggacgtccctaatgtcggccgagagccccacgccgc 

gctcctgccaggagggcaggcagggcccagaggatggagagaanactgcccagtgggtag 

gtcccaccagcagctgggggccttcaaacaggtcctgcggtrtactgtaccttacagatga 

aaaccagacattcattccctgatgcgggagggagaagggaagtaatgatgaggattggcc 

gaaaaggtgggtggctggccatgatggaccttccatctgcagggtttcataggactgcgc 

attcacagccagagatggacttggcagtgggctgaagacgctgtccactctgccaccttg 

ggtttacctctctcatgcaggtcactgtttccactgtaataggagagtttgtttggatgc 

ctgggtgctaggacaggtaacacagaagcttaggatggtagcaggggaagcatcttttgg 

cagatggccagacatggtaagtgtgagaggagtctgcctgatacacgattgacttttgag 

ctggggatatttgggcttcactgtgatcattcagcccccaggggaggagattgtaacgtt 

agaaagagtaggatatcgttgggagagccacttagttgtgtcctttctctcccgatcagg 

gcagaacatctgaatttgcctgaaccctgttctctgttttgcccattatagaattaaaaa 

atgtctctgtgtggactgtttttttgcagccagtcttaatcctgcttgctgaaatttgag 

ctcacttctccatgttctccttgagaacggaaccatcgtccctaagccctgagtgaaatc 

acaccagcttaaggccactgctctgccactcctcagccttttcttgtttgttatctccgg 

gaagttttgtacactttggttgtttcagtttctgttcatgagtagtcttctttcttggct 

gaacgtctagattgggactctctctgcagagaaccggtactgaagcaactgtcattttca 

gtttttgtttcatttggctttttctttagctgttcaccttattagcaaggcagcccatga 

ccttgacttgccacagttccaaaacacaaattcrttacagatcggtttgtgctagtgtctg 

gcaggtgtcctgccctccctcgttacctcctcatttgtgcctgcccaccttcccagagcc 

tgcgtcttctcagatgcttaacacctgtttagcctctctagttcagagctacaaatttac 

atgcttgattctgtggggcagaaagttcaaagtaatttcttcctctgcaaattcccagta 

tcttagtcacacgcaaagagagtgtccctgtgcactgactcctctagctagtgatttgtc 

agccaaaaatgtttatttatctcctggcctgtctcctcccatatcagtatggccacatga 

acagaattgagtgacctcctgagtccctgtattaggaaggggaaagatcttttgattcat 

taaccattaagttgattcattaaccattaagtcttgggcctgcagaccatagcaaccttc 

cttccttcatttatggtgcttcatccagctccaaatcttctctactttgtcctcacaaac 

ttttcatatgccctagtagctcatagactgctccttatatctggaaagcaacattcaaac 

ttctcatttctggttccaaaaatccgtgcattacatggataggctgccntgggggacatt 

cngcggcccccacgatgtggtttcccacagagaagccaggagaacgaggaggacggtgag 

gaggaccctgaccgctatgtctgtagtggggttcccgggcggccgccaggcctggaggaa 

gagctgaccctcaaatacggagcgaagcatgtgatcatgctgtttgtgcctgtcactctg 

tgcatgatcgtggtggtagccaccatcaagtctgtgcgcttctacacagagaagaatgga 

cagctgtgagttggggggctggggggagcagggtggggtgagggctgagttgccaggggg 

tggggggcgcagcagcctgtgttggtcactgtacctgcagctccacaccagcagcggtaa 

agagcagggatgaagaaccgcccaggttcatggcctggctcactgcctcctggattgtga 

cctacttgggcatgcttttaacatccctatgcctcagcttccttgttcgtataatgggtt 

gataacgcagttactgggagaattaagtgagtta^tatgagtgaagggcttagaagagtg 

tctactgcacgtgagtgctcaggcaagctggatcctgctgcagaaagcaagctcttgatc 

ctgggcatggctgtgccactgatccctgtgtgactgcaaacaaatcacttcctctctgag 

tctctgcttccctgaatgtgaaacaagqtggttggaccagatatttctcagctcacttcc 

agccttgtgaggaagacttataaagcctttcgtttattttagtaaaatacatgcagaggc 

agcagcgtagaaaaatgagaagcttcctccacttcttccccctcccctttctgtggtcct 

cactgctaagcaccttctgtaaacttttttttttttttttaaagttaggganttttgttt 

catttcgtgtgtgttggttttttttgttgttgttgtttcttttaaagaaaggaataaggc 

caggtgtggtgtctcatgcctgtaatcccagcactttgggagactgaggtgagaggatta 

tttgagcccaggagttcgagaccagcctgggaaatgtggcgagaccctgtctgtacaaaa 

aatgcaaaaattagccaggtgtggtggtacatgcctgtagtctcagctacttgggagact 

gaggtggaagaacacctgagcccagaagtcgaggctgcagtgagccatgattgcgccact 

gcactgcagcctcagcaacagagtgagaccctgcctcaaaatttttttaa 
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FIGURE 16 

Genomic sequence of stm.genD 
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FIGURE 17 

Genomic sequence of stm.genE 
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FIGURE 18 

Genomic sequence of stm.genF 



taggcattgggggacccagcaggaLgaagaccfcaqSatcccaoon^ 



ggtagtgccaOTactgtcatgagLlcSJmc?!™™?? 
ggtcttggctctgaJgaccLgticttatllS^ff'''' ""^"^'''""''Sa 

at^gacaaatgc!a,a|agag?|acSM5aSc«S!''''''"""^ 
a,catgag=a,atacctgIa|gnS'tc«S|cJS?t?2?"^^^^ 

gX=:SSea»,%~^^^^^^^ 
i1?lSc?S-|a%~^ 

S!|a?S?l?L1™'^^^ 
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FIGURE 19 

Genomic sequence of stm.genG 

cagggtctcgctctgttgcccaggctggagcgcagtagcacgattgtggctcactgcaaccctgacttcctqggc 

cc.agtgatcctcccaccttggcctcctgagtagctgggattacaggcacgtgctaccacacctggctcatttat 

atttttagtagagacaaggttttgacttgtrgcccgggctggtttcgaactcctgggctcaagtgatccacctgc 

cttggcctrccaaagcactgggattataggtctgagccacagcacctggccaaaattagtcttattataagagat 

gcaactcaggaccagccaaatgaagagacagtgaagaagtaatgctgatggatcacacctggtgggggaaggcgg 

acagctggggccaggagcaggagggacacctgcagggctggaagggcaggggaggtgggcccccatggtttgtgt 

ttattgcataaccatttttattgtctacagtgagcaaagttatcctataaacaagtgtcagggaccattgcaeta 

aagaaaacaaatgagagcattttggaagctctaatttcctgatcagtaatgggtagactaattcccagttatatt 

tacctgttgtaaggtgaaaggttcttcagaggacctctgtcttggtgttatatgggcttttgaatgtactqaaat 

caaattccctaaaaanctgtgattcagacttcatactaaattgtacagcagtgcccagcccaaggccttgcattt 

ctatttgttgttttctttactctctaagtgcccaacactggtttrtacctgagtttcagaactgcccgcttttctc 

tgcccaggrtgtaagtcacccagtccacaggtgtcccctgctttcccactggccactgatttggggaggcaqctq 

tccatgtccccagrccacatcttagcttctagaggccaggtggggtgggctgggctgggcaaglgcagccgggcc 

ttictgggccagagtttctcttctttttccatcctgtgcacgcctcttcagtacgggtt.ctgtctctcctc^ac 

agggggcgtgaagcttggccteqgggacttcatcttctacagtgtgctggtgggcaaggcggctgccacgggcag 

cggggactggaaraccacgctggccrgctrcgtggccatcctcattgtgagtggctggggatgcgtccaactgcc 

tcgtggtgggggeccccagggtcctcatcgtggtgggggcaggtctcaggatccctagggatttttcatttcttc 

tcttccctctgagggacaagagcagggagcggggctggaagggtcagcttgagaccaaggctcacaggaggtgtg 

ctcgcccctaggtgggccccagcctgcggaggacagcgcaggggagggtgaggagtgtaccggccccagcgtggc 

tgagcacacagcctccaggccgaggacccagccgacagcrttgcgcagtgatgataccctcgaggtggttgtgat 

gacatcagatctgc-gaaaagaaaattgcttaagggccttgcccatgggcgcaaagctagtgaggaccatgtttt 

ccccctcctccatgccattgggacaccacagggtetgaatctggggcactaggggtggccecgttactgtgaacc 

acagcagtgaaangtggagqccctgtagtcagttaacgtggccagatacacacaatggggagacgtcctgccgtg 

acttcatctcagagatttcgctgtcacgttagaggaggaggagcgtctgagccgtgcgcttggcatctgcccctt 

agtgaaaaccctgggcatggcatgattaaggttgatgctccagtgtccagaaggttttctttttgcccacaagra 

tatcagggatgggatggtggacccaggctcctccaccaccagactgccttacctgagccctgctggccccaaaga 

ratagaaggcaccctggttccctgtgctcacctggaccactgcctgcatcagctgggtcaggggaggatgggcag 

!!!"!!r"^r"""''''"''"5'="9'=99"-'9"'«="ggtgccagc4«gagaac^^^ 

tgccatttcagttgagcccacctagetctcatataaatacatgttccctgagggcatcttaccatcccatgtgac 

cactccagccagacaggggaggcagcacggcctcggggcacagcactgctccaggagtcaggaggcctgccttct 

ggtccactcactaacaggtgaggtgacctaatgggggtgagaacttctgcccctaacacctcaagagctgctqca 

ggaccagqgaagataatggggtgtctagcgccgttatccgaccggtcctcgaacaagctcctgtgcccagggact 

agaccatgactcacagctcctgtccacaccagggatcaccacgctcaccctcccctccatgtcctgcagggcttg 

tgtetgaccctcctgetgcttgctgtgttcaagaaggcgctgcccgccctccccatctccatcacgttcgggctc 

atcttttacttctccacggacaacctggtgcggccgttcatggacaccctggcctcccatcagctctacatctga 

gggacatggcgtgccacaggccgcaagctgcagggaattctcattggaegcagttgtatagttctacactctagt 

'""""r""'""""""""*''"""9'^"9tgtctacttggtgaggaggaggcagaacc. 

gctctttggtgccagctgtttcatcaccagactttggctcccgctttggggagcgcctcgcttcacggacaggaa 

gcacagcaggtttatccagatgaactgagaaggtcagattagggcggggagaagagcatccggcatgagggctga 

gatgcgcaaagagtgtgctcgggagtggcccctggcacctgggtgctctggctggagaggaaaagccagttccct 

acgaggagtgttcccaatgctttgtccatgatgtccttgttattctattgcctttagaaaccgagtcctgctcct 

gttacggcagtcacactgctgggaagtggcttaatagtaatatcaataaatagatgagtcctgtcagaatcttgg 

agtttggtccgttgtaaatgttgacccctctccctgcatc^gggcacccctgggataacttgtgctgtgagccc 

-ggatggaggcagtttgccctgtttgaaggaacttttaatgatctcgcctctctgcacacatttctttaactaga 

aagtttcctaageaaaggagttaggagagcagggtggcctgacatctgccagccctgagctgtaaggctgtggat 

gctgagcaggtccctggactcagttgtgcacggtggcacagacactgccaggtggttgccaaaacatccagtggt 

tec tcagcaagtgttcaccctctgcagaagcctgtgagggcctgagctcagaaaccactctcctttccttctct 

ggctttggccctgggcactgtggtgggagagtggacagtttggctttgccetctctgtacatcaatcatgggttg 

caaagagaatctcagaagtgcccctccctgagcacagtggctcacacctgtaatcccaatacttcgggaggtcga 

gtcggaaggatcacttgagcccaggagtttgagaccagccggggcaacatagtgagaccntgcacaaaaaaaaat 

ttaaaaattagccaagcatggtggcatgcatctgtagtctcagctactctggaggctgaggtgggaggatcactt 

aagcccaggaggctgaggctgcaatgagccgagatcaagcaggtgttaggtatatcagacagctgagaagacgca 

.gtgtgccctggggttcaaactggtacccctgtctccctgttccaggaataacatgagtgccgggacaatgcatc 
tttattatgagaggaatigagaattgt ^ ^ -a^-^ y 
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